Warning: Permanently added '3.92.206.196' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/8753595-fedora-42-x86_64 --chroot fedora-42-x86_64 Version: 1.2 PID: 9007 Logging PID: 9008 Task: {'allow_user_ssh': False, 'appstream': False, 'background': False, 'build_id': 8753595, 'buildroot_pkgs': [], 'chroot': 'fedora-42-x86_64', 'enable_net': False, 'fedora_review': False, 'git_hash': 'de426df52037452b808bb91e9442bad70dec40e7', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/trix/F42/rccl', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'rccl', 'package_version': '6.3.0-3', 'project_dirname': 'F42', 'project_name': 'F42', 'project_owner': 'trix', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/trix/F42/fedora-42-x86_64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}], 'sandbox': 'trix/F42--trix', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': 0, 'submitter': 'trix', 'tags': [], 'task_id': '8753595-fedora-42-x86_64', 'timeout': 18000, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/trix/F42/rccl /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/trix/F42/rccl', '/var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl'... Running: git checkout de426df52037452b808bb91e9442bad70dec40e7 -- cmd: ['git', 'checkout', 'de426df52037452b808bb91e9442bad70dec40e7', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl rc: 0 stdout: stderr: Note: switching to 'de426df52037452b808bb91e9442bad70dec40e7'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at de426df automatic import of rccl Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources INFO: Downloading RCCL-6.3.0.tar.gz INFO: Reading stdout from command: curl --help all INFO: Calling: curl -H Pragma: -o RCCL-6.3.0.tar.gz --location --connect-timeout 60 --retry 3 --retry-delay 10 --remote-time --show-error --fail --retry-all-errors https://copr-dist-git.fedorainfracloud.org/repo/pkgs/trix/F42/rccl/RCCL-6.3.0.tar.gz/md5/5c206e5849ada8ccab5151f79e191f5b/RCCL-6.3.0.tar.gz % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 1785k 100 1785k 0 0 13.5M 0 --:--:-- --:--:-- --:--:-- 13.6M INFO: Reading stdout from command: md5sum RCCL-6.3.0.tar.gz /usr/bin/tail: /var/lib/copr-rpmbuild/main.log: file truncated Running (timeout=18000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1741782565.967003 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 6.1 starting (python version = 3.13.0, NVR = mock-6.1-1.fc41), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1741782565.967003 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl/rccl.spec) Config(fedora-42-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.1 INFO: Mock Version: 6.1 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-42-x86_64-bootstrap-1741782565.967003/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using container image: registry.fedoraproject.org/fedora:42 INFO: Pulling image: registry.fedoraproject.org/fedora:42 INFO: Tagging container image as mock-bootstrap-44d6f86a-3374-494d-bfc9-4fc86b8cda35 INFO: Checking that 5121baae3d7cd791ad3e33d82fe0e734e8e54ae88820340f438c3087af8b8da7 image matches host's architecture INFO: Copy content of container 5121baae3d7cd791ad3e33d82fe0e734e8e54ae88820340f438c3087af8b8da7 to /var/lib/mock/fedora-42-x86_64-bootstrap-1741782565.967003/root INFO: mounting 5121baae3d7cd791ad3e33d82fe0e734e8e54ae88820340f438c3087af8b8da7 with podman image mount INFO: image 5121baae3d7cd791ad3e33d82fe0e734e8e54ae88820340f438c3087af8b8da7 as /var/lib/containers/storage/overlay/5444575277de4e2f85da3e3dec6ec9ee731492ee40ca0b1af5578729a71c7311/merged INFO: umounting image 5121baae3d7cd791ad3e33d82fe0e734e8e54ae88820340f438c3087af8b8da7 (/var/lib/containers/storage/overlay/5444575277de4e2f85da3e3dec6ec9ee731492ee40ca0b1af5578729a71c7311/merged) with podman image umount INFO: Removing image mock-bootstrap-44d6f86a-3374-494d-bfc9-4fc86b8cda35 INFO: Package manager dnf5 detected and used (fallback) INFO: Not updating bootstrap chroot, bootstrap_image_ready=True Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-42-x86_64-1741782565.967003/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf5 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.20.0-8.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 dnf5-5.2.10.0-2.fc42.x86_64 dnf5-plugins-5.2.10.0-2.fc42.x86_64 Start: installing minimal buildroot with dnf5 Updating and loading repositories: updates 100% | 110.1 KiB/s | 31.9 KiB | 00m00s fedora 100% | 58.7 MiB/s | 35.5 MiB | 00m01s Copr repository 100% | 1.5 MiB/s | 120.0 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing group/module packages: bash x86_64 5.2.37-1.fc42 fedora 8.2 MiB bzip2 x86_64 1.0.8-20.fc42 fedora 99.3 KiB coreutils x86_64 9.6-1.fc42 fedora 5.5 MiB cpio x86_64 2.15-2.fc41 fedora 1.1 MiB diffutils x86_64 3.10-9.fc42 fedora 1.6 MiB fedora-release-common noarch 42-0.21 fedora 20.1 KiB findutils x86_64 1:4.10.0-5.fc42 fedora 1.9 MiB gawk x86_64 5.3.1-1.fc42 fedora 1.7 MiB glibc-minimal-langpack x86_64 2.40.9000-35.fc42 fedora 0.0 B grep x86_64 3.11-10.fc42 fedora 1.0 MiB gzip x86_64 1.13-3.fc42 fedora 392.9 KiB info x86_64 7.2-3.fc42 fedora 357.9 KiB patch x86_64 2.7.6-26.fc42 fedora 258.7 KiB redhat-rpm-config noarch 342-2.fc42 fedora 186.8 KiB rpm-build x86_64 4.20.0-8.fc42 fedora 165.2 KiB sed x86_64 4.9-4.fc42 fedora 857.3 KiB shadow-utils x86_64 2:4.17.0-4.fc42 fedora 4.0 MiB tar x86_64 2:1.35-5.fc42 fedora 3.0 MiB unzip x86_64 6.0-66.fc42 fedora 390.3 KiB util-linux x86_64 2.40.4-7.fc42 fedora 3.4 MiB which x86_64 2.23-1.fc42 fedora 83.4 KiB xz x86_64 1:5.6.3-3.fc42 fedora 1.2 MiB Installing dependencies: add-determinism x86_64 0.6.0-1.fc42 fedora 2.5 MiB alternatives x86_64 1.31-3.fc42 fedora 66.2 KiB ansible-srpm-macros noarch 1-17.1.fc42 fedora 35.7 KiB audit-libs x86_64 4.0.3-2.fc42 fedora 351.3 KiB basesystem noarch 11-22.fc42 fedora 0.0 B binutils x86_64 2.44-3.fc42 fedora 25.9 MiB build-reproducibility-srpm-macros noarch 0.6.0-1.fc42 fedora 735.0 B bzip2-libs x86_64 1.0.8-20.fc42 fedora 84.6 KiB ca-certificates noarch 2024.2.69_v8.0.401-5.fc42 fedora 2.6 MiB coreutils-common x86_64 9.6-1.fc42 fedora 11.1 MiB crypto-policies noarch 20250214-1.gitff7551b.fc42 fedora 137.2 KiB curl x86_64 8.11.1-4.fc42 fedora 450.6 KiB cyrus-sasl-lib x86_64 2.1.28-30.fc42 fedora 2.3 MiB debugedit x86_64 5.1-4.fc42 fedora 200.4 KiB dwz x86_64 0.15-9.fc42 fedora 291.0 KiB ed x86_64 1.21-2.fc42 fedora 146.5 KiB efi-srpm-macros noarch 6-2.fc42 fedora 40.1 KiB elfutils x86_64 0.192-8.fc42 fedora 2.7 MiB elfutils-debuginfod-client x86_64 0.192-8.fc42 fedora 83.9 KiB elfutils-default-yama-scope noarch 0.192-8.fc42 fedora 1.8 KiB elfutils-libelf x86_64 0.192-8.fc42 fedora 1.2 MiB elfutils-libs x86_64 0.192-8.fc42 fedora 675.0 KiB fedora-gpg-keys noarch 42-0.5 fedora 128.2 KiB fedora-release noarch 42-0.21 fedora 0.0 B fedora-release-identity-basic noarch 42-0.21 fedora 701.0 B fedora-repos noarch 42-0.5 fedora 4.9 KiB file x86_64 5.46-1.fc42 fedora 100.2 KiB file-libs x86_64 5.46-1.fc42 fedora 11.9 MiB filesystem x86_64 3.18-36.fc42 fedora 112.0 B filesystem-srpm-macros noarch 3.18-36.fc42 fedora 38.2 KiB fonts-srpm-macros noarch 1:2.0.5-21.fc42 fedora 55.8 KiB forge-srpm-macros noarch 0.4.0-2.fc42 fedora 38.9 KiB fpc-srpm-macros noarch 1.3-14.fc42 fedora 144.0 B gdb-minimal x86_64 16.2-2.fc42 fedora 13.3 MiB gdbm-libs x86_64 1:1.23-9.fc42 fedora 129.9 KiB ghc-srpm-macros noarch 1.9.2-2.fc42 fedora 779.0 B glibc x86_64 2.40.9000-35.fc42 fedora 6.6 MiB glibc-common x86_64 2.40.9000-35.fc42 fedora 1.0 MiB glibc-gconv-extra x86_64 2.40.9000-35.fc42 fedora 7.2 MiB gmp x86_64 1:6.3.0-2.fc41 fedora 811.4 KiB gnat-srpm-macros noarch 6-7.fc42 fedora 1.0 KiB go-srpm-macros noarch 3.6.0-6.fc42 fedora 60.8 KiB jansson x86_64 2.14-2.fc42 fedora 93.1 KiB json-c x86_64 0.18-2.fc42 fedora 86.7 KiB kernel-srpm-macros noarch 1.0-25.fc42 fedora 1.9 KiB keyutils-libs x86_64 1.6.3-5.fc42 fedora 58.3 KiB krb5-libs x86_64 1.21.3-5.fc42 fedora 2.3 MiB libacl x86_64 2.3.2-3.fc42 fedora 38.3 KiB libarchive x86_64 3.7.7-2.fc42 fedora 938.6 KiB libattr x86_64 2.5.2-5.fc42 fedora 27.1 KiB libblkid x86_64 2.40.4-7.fc42 fedora 262.4 KiB libbrotli x86_64 1.1.0-6.fc42 fedora 841.3 KiB libcap x86_64 2.73-2.fc42 fedora 207.1 KiB libcap-ng x86_64 0.8.5-4.fc42 fedora 72.9 KiB libcom_err x86_64 1.47.2-3.fc42 fedora 67.1 KiB libcurl x86_64 8.11.1-4.fc42 fedora 842.1 KiB libeconf x86_64 0.7.6-1.fc42 fedora 64.6 KiB libevent x86_64 2.1.12-15.fc42 fedora 903.1 KiB libfdisk x86_64 2.40.4-7.fc42 fedora 372.3 KiB libffi x86_64 3.4.6-5.fc42 fedora 82.3 KiB libgcc x86_64 15.0.1-0.9.fc42 fedora 266.6 KiB libgomp x86_64 15.0.1-0.9.fc42 fedora 535.9 KiB libidn2 x86_64 2.3.7-3.fc42 fedora 329.0 KiB libmount x86_64 2.40.4-7.fc42 fedora 356.3 KiB libnghttp2 x86_64 1.64.0-3.fc42 fedora 170.4 KiB libpkgconf x86_64 2.3.0-2.fc42 fedora 78.1 KiB libpsl x86_64 0.21.5-5.fc42 fedora 76.4 KiB libselinux x86_64 3.8-1.fc42 fedora 193.1 KiB libsemanage x86_64 3.8-1.fc42 fedora 308.4 KiB libsepol x86_64 3.8-1.fc42 fedora 826.0 KiB libsmartcols x86_64 2.40.4-7.fc42 fedora 180.4 KiB libssh x86_64 0.11.1-4.fc42 fedora 565.5 KiB libssh-config noarch 0.11.1-4.fc42 fedora 277.0 B libstdc++ x86_64 15.0.1-0.9.fc42 fedora 2.8 MiB libtasn1 x86_64 4.20.0-1.fc42 fedora 176.3 KiB libtool-ltdl x86_64 2.5.4-4.fc42 fedora 70.1 KiB libunistring x86_64 1.1-9.fc42 fedora 1.7 MiB libuuid x86_64 2.40.4-7.fc42 fedora 37.3 KiB libverto x86_64 0.3.2-10.fc42 fedora 25.4 KiB libxcrypt x86_64 4.4.38-6.fc42 fedora 284.5 KiB libxml2 x86_64 2.12.9-2.fc42 fedora 1.7 MiB libzstd x86_64 1.5.6-3.fc42 fedora 795.8 KiB lua-libs x86_64 5.4.7-2.fc42 fedora 280.9 KiB lua-srpm-macros noarch 1-15.fc42 fedora 1.3 KiB lz4-libs x86_64 1.10.0-2.fc42 fedora 157.4 KiB mpfr x86_64 4.2.1-6.fc42 fedora 831.9 KiB ncurses-base noarch 6.5-5.20250125.fc42 fedora 326.8 KiB ncurses-libs x86_64 6.5-5.20250125.fc42 fedora 946.3 KiB ocaml-srpm-macros noarch 10-4.fc42 fedora 1.9 KiB openblas-srpm-macros noarch 2-19.fc42 fedora 112.0 B openldap x86_64 2.6.9-3.fc42 fedora 655.1 KiB openssl-libs x86_64 1:3.2.4-1.fc42 fedora 7.8 MiB p11-kit x86_64 0.25.5-5.fc42 fedora 2.2 MiB p11-kit-trust x86_64 0.25.5-5.fc42 fedora 395.5 KiB package-notes-srpm-macros noarch 0.5-13.fc42 fedora 1.6 KiB pam-libs x86_64 1.7.0-4.fc42 fedora 126.7 KiB pcre2 x86_64 10.44-1.fc42.2 fedora 649.3 KiB pcre2-syntax noarch 10.44-1.fc42.2 fedora 251.6 KiB perl-srpm-macros noarch 1-57.fc42 fedora 861.0 B pkgconf x86_64 2.3.0-2.fc42 fedora 88.5 KiB pkgconf-m4 noarch 2.3.0-2.fc42 fedora 14.4 KiB pkgconf-pkg-config x86_64 2.3.0-2.fc42 fedora 989.0 B popt x86_64 1.19-8.fc42 fedora 132.8 KiB publicsuffix-list-dafsa noarch 20250116-1.fc42 fedora 68.5 KiB pyproject-srpm-macros noarch 1.17.0-1.fc42 fedora 1.9 KiB python-srpm-macros noarch 3.13-4.fc42 fedora 51.0 KiB qt5-srpm-macros noarch 5.15.15-1.fc42 fedora 500.0 B qt6-srpm-macros noarch 6.8.2-2.fc42 fedora 464.0 B readline x86_64 8.2-12.fc42 fedora 485.0 KiB rpm x86_64 4.20.0-8.fc42 fedora 3.0 MiB rpm-build-libs x86_64 4.20.0-8.fc42 fedora 202.6 KiB rpm-libs x86_64 4.20.0-8.fc42 fedora 721.8 KiB rpm-sequoia x86_64 1.7.0-5.fc42 fedora 2.4 MiB rust-srpm-macros noarch 26.3-4.fc42 fedora 4.8 KiB setup noarch 2.15.0-12.fc42 fedora 720.8 KiB sqlite-libs x86_64 3.47.2-2.fc42 fedora 1.5 MiB systemd-libs x86_64 257.3-7.fc42 fedora 2.2 MiB systemd-standalone-sysusers x86_64 257.3-7.fc42 fedora 277.3 KiB tree-sitter-srpm-macros noarch 0.1.0-8.fc42 fedora 6.5 KiB util-linux-core x86_64 2.40.4-7.fc42 fedora 1.4 MiB xxhash-libs x86_64 0.8.3-2.fc42 fedora 90.2 KiB xz-libs x86_64 1:5.6.3-3.fc42 fedora 218.3 KiB zig-srpm-macros noarch 1-4.fc42 fedora 1.1 KiB zip x86_64 3.0-43.fc42 fedora 698.5 KiB zlib-ng-compat x86_64 2.2.3-2.fc42 fedora 137.6 KiB zstd x86_64 1.5.6-3.fc42 fedora 1.7 MiB Installing groups: Buildsystem building group Transaction Summary: Installing: 148 packages Total size of inbound packages is 52 MiB. Need to download 52 MiB. After this operation, 176 MiB extra will be used (install 176 MiB, remove 0 B). [ 1/148] bzip2-0:1.0.8-20.fc42.x86_64 100% | 3.9 MiB/s | 52.1 KiB | 00m00s [ 2/148] cpio-0:2.15-2.fc41.x86_64 100% | 95.0 MiB/s | 291.8 KiB | 00m00s [ 3/148] coreutils-0:9.6-1.fc42.x86_64 100% | 60.7 MiB/s | 1.2 MiB | 00m00s [ 4/148] bash-0:5.2.37-1.fc42.x86_64 100% | 82.2 MiB/s | 1.8 MiB | 00m00s [ 5/148] diffutils-0:3.10-9.fc42.x86_6 100% | 79.0 MiB/s | 404.6 KiB | 00m00s [ 6/148] fedora-release-common-0:42-0. 100% | 8.1 MiB/s | 24.9 KiB | 00m00s [ 7/148] findutils-1:4.10.0-5.fc42.x86 100% | 179.5 MiB/s | 551.5 KiB | 00m00s [ 8/148] glibc-minimal-langpack-0:2.40 100% | 41.6 MiB/s | 127.9 KiB | 00m00s [ 9/148] grep-0:3.11-10.fc42.x86_64 100% | 97.7 MiB/s | 300.1 KiB | 00m00s [ 10/148] gzip-0:1.13-3.fc42.x86_64 100% | 83.2 MiB/s | 170.4 KiB | 00m00s [ 11/148] info-0:7.2-3.fc42.x86_64 100% | 59.8 MiB/s | 183.8 KiB | 00m00s [ 12/148] patch-0:2.7.6-26.fc42.x86_64 100% | 62.7 MiB/s | 128.4 KiB | 00m00s [ 13/148] redhat-rpm-config-0:342-2.fc4 100% | 39.9 MiB/s | 81.6 KiB | 00m00s [ 14/148] rpm-build-0:4.20.0-8.fc42.x86 100% | 40.1 MiB/s | 82.0 KiB | 00m00s [ 15/148] sed-0:4.9-4.fc42.x86_64 100% | 103.3 MiB/s | 317.3 KiB | 00m00s [ 16/148] unzip-0:6.0-66.fc42.x86_64 100% | 90.1 MiB/s | 184.6 KiB | 00m00s [ 17/148] tar-2:1.35-5.fc42.x86_64 100% | 140.4 MiB/s | 862.5 KiB | 00m00s [ 18/148] shadow-utils-2:4.17.0-4.fc42. 100% | 163.8 MiB/s | 1.3 MiB | 00m00s [ 19/148] which-0:2.23-1.fc42.x86_64 100% | 10.2 MiB/s | 41.7 KiB | 00m00s [ 20/148] xz-1:5.6.3-3.fc42.x86_64 100% | 154.6 MiB/s | 475.0 KiB | 00m00s [ 21/148] gawk-0:5.3.1-1.fc42.x86_64 100% | 154.1 MiB/s | 1.1 MiB | 00m00s [ 22/148] util-linux-0:2.40.4-7.fc42.x8 100% | 144.3 MiB/s | 1.2 MiB | 00m00s [ 23/148] filesystem-0:3.18-36.fc42.x86 100% | 147.9 MiB/s | 1.3 MiB | 00m00s [ 24/148] ncurses-libs-0:6.5-5.20250125 100% | 109.0 MiB/s | 335.0 KiB | 00m00s [ 25/148] bzip2-libs-0:1.0.8-20.fc42.x8 100% | 10.6 MiB/s | 43.6 KiB | 00m00s [ 26/148] glibc-0:2.40.9000-35.fc42.x86 100% | 206.6 MiB/s | 2.3 MiB | 00m00s [ 27/148] gmp-1:6.3.0-2.fc41.x86_64 100% | 62.1 MiB/s | 318.0 KiB | 00m00s [ 28/148] libacl-0:2.3.2-3.fc42.x86_64 100% | 11.2 MiB/s | 23.0 KiB | 00m00s [ 29/148] coreutils-common-0:9.6-1.fc42 100% | 176.7 MiB/s | 2.1 MiB | 00m00s [ 30/148] libattr-0:2.5.2-5.fc42.x86_64 100% | 5.6 MiB/s | 17.1 KiB | 00m00s [ 31/148] libcap-0:2.73-2.fc42.x86_64 100% | 27.4 MiB/s | 84.3 KiB | 00m00s [ 32/148] libselinux-0:3.8-1.fc42.x86_6 100% | 47.4 MiB/s | 97.1 KiB | 00m00s [ 33/148] systemd-libs-0:257.3-7.fc42.x 100% | 200.1 MiB/s | 819.6 KiB | 00m00s [ 34/148] fedora-repos-0:42-0.5.noarch 100% | 4.6 MiB/s | 9.4 KiB | 00m00s [ 35/148] openssl-libs-1:3.2.4-1.fc42.x 100% | 234.6 MiB/s | 2.3 MiB | 00m00s [ 36/148] pcre2-0:10.44-1.fc42.2.x86_64 100% | 47.5 MiB/s | 243.4 KiB | 00m00s [ 37/148] glibc-common-0:2.40.9000-35.f 100% | 67.6 MiB/s | 415.4 KiB | 00m00s [ 38/148] ansible-srpm-macros-0:1-17.1. 100% | 19.8 MiB/s | 20.3 KiB | 00m00s [ 39/148] ed-0:1.21-2.fc42.x86_64 100% | 40.0 MiB/s | 82.0 KiB | 00m00s [ 40/148] build-reproducibility-srpm-ma 100% | 5.7 MiB/s | 11.7 KiB | 00m00s [ 41/148] dwz-0:0.15-9.fc42.x86_64 100% | 66.3 MiB/s | 135.7 KiB | 00m00s [ 42/148] efi-srpm-macros-0:6-2.fc42.no 100% | 11.0 MiB/s | 22.5 KiB | 00m00s [ 43/148] file-0:5.46-1.fc42.x86_64 100% | 23.8 MiB/s | 48.7 KiB | 00m00s [ 44/148] filesystem-srpm-macros-0:3.18 100% | 25.0 MiB/s | 25.6 KiB | 00m00s [ 45/148] fonts-srpm-macros-1:2.0.5-21. 100% | 26.5 MiB/s | 27.1 KiB | 00m00s [ 46/148] forge-srpm-macros-0:0.4.0-2.f 100% | 9.7 MiB/s | 19.9 KiB | 00m00s [ 47/148] fpc-srpm-macros-0:1.3-14.fc42 100% | 7.8 MiB/s | 8.0 KiB | 00m00s [ 48/148] ghc-srpm-macros-0:1.9.2-2.fc4 100% | 8.9 MiB/s | 9.2 KiB | 00m00s [ 49/148] gnat-srpm-macros-0:6-7.fc42.n 100% | 8.4 MiB/s | 8.6 KiB | 00m00s [ 50/148] kernel-srpm-macros-0:1.0-25.f 100% | 9.6 MiB/s | 9.9 KiB | 00m00s [ 51/148] go-srpm-macros-0:3.6.0-6.fc42 100% | 27.0 MiB/s | 27.7 KiB | 00m00s [ 52/148] lua-srpm-macros-0:1-15.fc42.n 100% | 8.7 MiB/s | 8.9 KiB | 00m00s [ 53/148] ocaml-srpm-macros-0:10-4.fc42 100% | 9.0 MiB/s | 9.2 KiB | 00m00s [ 54/148] openblas-srpm-macros-0:2-19.f 100% | 3.8 MiB/s | 7.8 KiB | 00m00s [ 55/148] package-notes-srpm-macros-0:0 100% | 9.0 MiB/s | 9.3 KiB | 00m00s [ 56/148] perl-srpm-macros-0:1-57.fc42. 100% | 8.3 MiB/s | 8.5 KiB | 00m00s [ 57/148] pyproject-srpm-macros-0:1.17. 100% | 13.6 MiB/s | 13.9 KiB | 00m00s [ 58/148] qt5-srpm-macros-0:5.15.15-1.f 100% | 8.7 MiB/s | 8.9 KiB | 00m00s [ 59/148] python-srpm-macros-0:3.13-4.f 100% | 22.4 MiB/s | 23.0 KiB | 00m00s [ 60/148] qt6-srpm-macros-0:6.8.2-2.fc4 100% | 9.1 MiB/s | 9.3 KiB | 00m00s [ 61/148] rust-srpm-macros-0:26.3-4.fc4 100% | 11.4 MiB/s | 11.7 KiB | 00m00s [ 62/148] rpm-0:4.20.0-8.fc42.x86_64 100% | 177.7 MiB/s | 545.8 KiB | 00m00s [ 63/148] tree-sitter-srpm-macros-0:0.1 100% | 5.5 MiB/s | 11.2 KiB | 00m00s [ 64/148] zig-srpm-macros-0:1-4.fc42.no 100% | 4.0 MiB/s | 8.2 KiB | 00m00s [ 65/148] debugedit-0:5.1-4.fc42.x86_64 100% | 38.5 MiB/s | 78.9 KiB | 00m00s [ 66/148] zip-0:3.0-43.fc42.x86_64 100% | 85.8 MiB/s | 263.5 KiB | 00m00s [ 67/148] elfutils-0:0.192-8.fc42.x86_6 100% | 179.4 MiB/s | 551.0 KiB | 00m00s [ 68/148] elfutils-libelf-0:0.192-8.fc4 100% | 67.7 MiB/s | 208.1 KiB | 00m00s [ 69/148] libarchive-0:3.7.7-2.fc42.x86 100% | 135.7 MiB/s | 416.9 KiB | 00m00s [ 70/148] popt-0:1.19-8.fc42.x86_64 100% | 32.2 MiB/s | 65.9 KiB | 00m00s [ 71/148] readline-0:8.2-12.fc42.x86_64 100% | 105.1 MiB/s | 215.2 KiB | 00m00s [ 72/148] rpm-build-libs-0:4.20.0-8.fc4 100% | 48.1 MiB/s | 98.6 KiB | 00m00s [ 73/148] rpm-libs-0:4.20.0-8.fc42.x86_ 100% | 153.2 MiB/s | 313.7 KiB | 00m00s [ 74/148] zstd-0:1.5.6-3.fc42.x86_64 100% | 117.1 MiB/s | 479.6 KiB | 00m00s [ 75/148] libeconf-0:0.7.6-1.fc42.x86_6 100% | 17.2 MiB/s | 35.2 KiB | 00m00s [ 76/148] audit-libs-0:4.0.3-2.fc42.x86 100% | 40.8 MiB/s | 125.3 KiB | 00m00s [ 77/148] pam-libs-0:1.7.0-4.fc42.x86_6 100% | 57.0 MiB/s | 58.3 KiB | 00m00s [ 78/148] libsemanage-0:3.8-1.fc42.x86_ 100% | 60.3 MiB/s | 123.6 KiB | 00m00s [ 79/148] libxcrypt-0:4.4.38-6.fc42.x86 100% | 62.2 MiB/s | 127.3 KiB | 00m00s [ 80/148] xz-libs-1:5.6.3-3.fc42.x86_64 100% | 55.4 MiB/s | 113.4 KiB | 00m00s [ 81/148] setup-0:2.15.0-12.fc42.noarch 100% | 50.7 MiB/s | 155.7 KiB | 00m00s [ 82/148] mpfr-0:4.2.1-6.fc42.x86_64 100% | 113.4 MiB/s | 348.5 KiB | 00m00s [ 83/148] libblkid-0:2.40.4-7.fc42.x86_ 100% | 119.7 MiB/s | 122.5 KiB | 00m00s [ 84/148] libcap-ng-0:0.8.5-4.fc42.x86_ 100% | 31.4 MiB/s | 32.2 KiB | 00m00s [ 85/148] libfdisk-0:2.40.4-7.fc42.x86_ 100% | 77.4 MiB/s | 158.5 KiB | 00m00s [ 86/148] libmount-0:2.40.4-7.fc42.x86_ 100% | 75.7 MiB/s | 155.1 KiB | 00m00s [ 87/148] libsmartcols-0:2.40.4-7.fc42. 100% | 39.7 MiB/s | 81.2 KiB | 00m00s [ 88/148] libuuid-0:2.40.4-7.fc42.x86_6 100% | 24.7 MiB/s | 25.3 KiB | 00m00s [ 89/148] zlib-ng-compat-0:2.2.3-2.fc42 100% | 38.5 MiB/s | 78.9 KiB | 00m00s [ 90/148] util-linux-core-0:2.40.4-7.fc 100% | 172.3 MiB/s | 529.2 KiB | 00m00s [ 91/148] basesystem-0:11-22.fc42.noarc 100% | 7.1 MiB/s | 7.3 KiB | 00m00s [ 92/148] ncurses-base-0:6.5-5.20250125 100% | 43.0 MiB/s | 88.1 KiB | 00m00s [ 93/148] libsepol-0:3.8-1.fc42.x86_64 100% | 113.6 MiB/s | 348.9 KiB | 00m00s [ 94/148] glibc-gconv-extra-0:2.40.9000 100% | 208.9 MiB/s | 1.7 MiB | 00m00s [ 95/148] crypto-policies-0:20250214-1. 100% | 32.1 MiB/s | 98.7 KiB | 00m00s [ 96/148] fedora-gpg-keys-0:42-0.5.noar 100% | 44.2 MiB/s | 135.7 KiB | 00m00s [ 97/148] ca-certificates-0:2024.2.69_v 100% | 132.4 MiB/s | 949.0 KiB | 00m00s [ 98/148] pcre2-syntax-0:10.44-1.fc42.2 100% | 48.7 MiB/s | 149.8 KiB | 00m00s [ 99/148] curl-0:8.11.1-4.fc42.x86_64 100% | 72.4 MiB/s | 222.4 KiB | 00m00s [100/148] add-determinism-0:0.6.0-1.fc4 100% | 149.5 MiB/s | 918.3 KiB | 00m00s [101/148] file-libs-0:5.46-1.fc42.x86_6 100% | 138.2 MiB/s | 849.4 KiB | 00m00s [102/148] elfutils-libs-0:0.192-8.fc42. 100% | 86.5 MiB/s | 265.9 KiB | 00m00s [103/148] elfutils-debuginfod-client-0: 100% | 22.7 MiB/s | 46.5 KiB | 00m00s [104/148] libzstd-0:1.5.6-3.fc42.x86_64 100% | 151.9 MiB/s | 311.1 KiB | 00m00s [105/148] lz4-libs-0:1.10.0-2.fc42.x86_ 100% | 38.1 MiB/s | 78.1 KiB | 00m00s [106/148] libxml2-0:2.12.9-2.fc42.x86_6 100% | 226.6 MiB/s | 696.0 KiB | 00m00s [107/148] lua-libs-0:5.4.7-2.fc42.x86_6 100% | 64.9 MiB/s | 133.0 KiB | 00m00s [108/148] elfutils-default-yama-scope-0 100% | 12.3 MiB/s | 12.6 KiB | 00m00s [109/148] sqlite-libs-0:3.47.2-2.fc42.x 100% | 179.4 MiB/s | 734.8 KiB | 00m00s [110/148] rpm-sequoia-0:1.7.0-5.fc42.x8 100% | 127.1 MiB/s | 911.1 KiB | 00m00s [111/148] json-c-0:0.18-2.fc42.x86_64 100% | 14.6 MiB/s | 44.9 KiB | 00m00s [112/148] libgcc-0:15.0.1-0.9.fc42.x86_ 100% | 57.6 MiB/s | 118.0 KiB | 00m00s [113/148] libgomp-0:15.0.1-0.9.fc42.x86 100% | 85.5 MiB/s | 350.3 KiB | 00m00s [114/148] libstdc++-0:15.0.1-0.9.fc42.x 100% | 144.7 MiB/s | 888.8 KiB | 00m00s [115/148] alternatives-0:1.31-3.fc42.x8 100% | 20.0 MiB/s | 40.9 KiB | 00m00s [116/148] jansson-0:2.14-2.fc42.x86_64 100% | 22.3 MiB/s | 45.7 KiB | 00m00s [117/148] pkgconf-pkg-config-0:2.3.0-2. 100% | 4.8 MiB/s | 9.9 KiB | 00m00s [118/148] pkgconf-0:2.3.0-2.fc42.x86_64 100% | 21.9 MiB/s | 44.9 KiB | 00m00s [119/148] pkgconf-m4-0:2.3.0-2.fc42.noa 100% | 7.0 MiB/s | 14.2 KiB | 00m00s [120/148] libffi-0:3.4.6-5.fc42.x86_64 100% | 19.5 MiB/s | 39.9 KiB | 00m00s [121/148] libpkgconf-0:2.3.0-2.fc42.x86 100% | 12.5 MiB/s | 38.4 KiB | 00m00s [122/148] binutils-0:2.44-3.fc42.x86_64 100% | 277.0 MiB/s | 5.8 MiB | 00m00s [123/148] libtasn1-0:4.20.0-1.fc42.x86_ 100% | 9.2 MiB/s | 75.0 KiB | 00m00s [124/148] p11-kit-0:0.25.5-5.fc42.x86_6 100% | 53.4 MiB/s | 491.7 KiB | 00m00s [125/148] fedora-release-0:42-0.21.noar 100% | 13.7 MiB/s | 14.0 KiB | 00m00s [126/148] p11-kit-trust-0:0.25.5-5.fc42 100% | 64.7 MiB/s | 132.6 KiB | 00m00s [127/148] xxhash-libs-0:0.8.3-2.fc42.x8 100% | 38.2 MiB/s | 39.1 KiB | 00m00s [128/148] systemd-standalone-sysusers-0 100% | 76.8 MiB/s | 157.4 KiB | 00m00s [129/148] fedora-release-identity-basic 100% | 7.2 MiB/s | 14.8 KiB | 00m00s [130/148] libcurl-0:8.11.1-4.fc42.x86_6 100% | 92.0 MiB/s | 376.9 KiB | 00m00s [131/148] krb5-libs-0:1.21.3-5.fc42.x86 100% | 106.7 MiB/s | 764.7 KiB | 00m00s [132/148] libbrotli-0:1.1.0-6.fc42.x86_ 100% | 66.4 MiB/s | 339.8 KiB | 00m00s [133/148] libidn2-0:2.3.7-3.fc42.x86_64 100% | 57.6 MiB/s | 118.0 KiB | 00m00s [134/148] gdb-minimal-0:16.2-2.fc42.x86 100% | 233.4 MiB/s | 4.4 MiB | 00m00s [135/148] libnghttp2-0:1.64.0-3.fc42.x8 100% | 12.6 MiB/s | 77.7 KiB | 00m00s [136/148] libpsl-0:0.21.5-5.fc42.x86_64 100% | 12.5 MiB/s | 64.0 KiB | 00m00s [137/148] keyutils-libs-0:1.6.3-5.fc42. 100% | 15.4 MiB/s | 31.5 KiB | 00m00s [138/148] openldap-0:2.6.9-3.fc42.x86_6 100% | 63.5 MiB/s | 260.2 KiB | 00m00s [139/148] libssh-0:0.11.1-4.fc42.x86_64 100% | 45.6 MiB/s | 233.3 KiB | 00m00s [140/148] libcom_err-0:1.47.2-3.fc42.x8 100% | 13.1 MiB/s | 26.9 KiB | 00m00s [141/148] libverto-0:0.3.2-10.fc42.x86_ 100% | 20.3 MiB/s | 20.8 KiB | 00m00s [142/148] libssh-config-0:0.11.1-4.fc42 100% | 8.8 MiB/s | 9.0 KiB | 00m00s [143/148] publicsuffix-list-dafsa-0:202 100% | 28.7 MiB/s | 58.8 KiB | 00m00s [144/148] libunistring-0:1.1-9.fc42.x86 100% | 176.6 MiB/s | 542.5 KiB | 00m00s [145/148] libtool-ltdl-0:2.5.4-4.fc42.x 100% | 17.7 MiB/s | 36.2 KiB | 00m00s [146/148] libevent-0:2.1.12-15.fc42.x86 100% | 84.7 MiB/s | 260.2 KiB | 00m00s [147/148] cyrus-sasl-lib-0:2.1.28-30.fc 100% | 129.1 MiB/s | 793.5 KiB | 00m00s [148/148] gdbm-libs-1:1.23-9.fc42.x86_6 100% | 18.6 MiB/s | 57.0 KiB | 00m00s -------------------------------------------------------------------------------- [148/148] Total 100% | 102.4 MiB/s | 52.2 MiB | 00m01s Running transaction Importing OpenPGP key 0x105EF944: UserID : "Fedora (42) " Fingerprint: B0F4950458F69E1150C6C5EDC8AC4916105EF944 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-42-primary The key was successfully imported. [ 1/150] Verify package files 100% | 822.0 B/s | 148.0 B | 00m00s [ 2/150] Prepare transaction 100% | 4.0 KiB/s | 148.0 B | 00m00s [ 3/150] Installing libgcc-0:15.0.1-0. 100% | 262.0 MiB/s | 268.3 KiB | 00m00s [ 4/150] Installing libssh-config-0:0. 100% | 0.0 B/s | 816.0 B | 00m00s [ 5/150] Installing publicsuffix-list- 100% | 0.0 B/s | 69.2 KiB | 00m00s [ 6/150] Installing fedora-release-ide 100% | 0.0 B/s | 960.0 B | 00m00s [ 7/150] Installing fedora-gpg-keys-0: 100% | 42.7 MiB/s | 174.8 KiB | 00m00s [ 8/150] Installing fedora-repos-0:42- 100% | 0.0 B/s | 5.7 KiB | 00m00s [ 9/150] Installing fedora-release-com 100% | 23.8 MiB/s | 24.4 KiB | 00m00s [ 10/150] Installing fedora-release-0:4 100% | 10.1 KiB/s | 124.0 B | 00m00s >>> Running unknown scriptlet: setup-0:2.15.0-12.fc42.noarch >>> Finished unknown scriptlet: setup-0:2.15.0-12.fc42.noarch >>> Scriptlet output: >>> Creating group 'adm' with GID 4. >>> Creating group 'audio' with GID 63. >>> Creating group 'bin' with GID 1. >>> Creating group 'cdrom' with GID 11. >>> Creating group 'clock' with GID 103. >>> Creating group 'daemon' with GID 2. >>> Creating group 'dialout' with GID 18. >>> Creating group 'disk' with GID 6. >>> Creating group 'floppy' with GID 19. >>> Creating group 'ftp' with GID 50. >>> Creating group 'games' with GID 20. >>> Creating group 'input' with GID 104. >>> Creating group 'kmem' with GID 9. >>> Creating group 'kvm' with GID 36. >>> Creating group 'lock' with GID 54. >>> Creating group 'lp' with GID 7. >>> Creating group 'mail' with GID 12. >>> Creating group 'man' with GID 15. >>> Creating group 'mem' with GID 8. >>> Creating group 'nobody' with GID 65534. >>> Creating group 'render' with GID 105. >>> Creating group 'root' with GID 0. >>> Creating group 'sgx' with GID 106. >>> Creating group 'sys' with GID 3. >>> Creating group 'tape' with GID 33. >>> Creating group 'tty' with GID 5. >>> Creating group 'users' with GID 100. >>> Creating group 'utmp' with GID 22. >>> Creating group 'video' with GID 39. >>> Creating group 'wheel' with GID 10. >>> >>> Running unknown scriptlet: setup-0:2.15.0-12.fc42.noarch >>> Finished unknown scriptlet: setup-0:2.15.0-12.fc42.noarch >>> Scriptlet output: >>> Creating user 'adm' (adm) with UID 3 and GID 4. >>> Creating user 'bin' (bin) with UID 1 and GID 1. >>> Creating user 'daemon' (daemon) with UID 2 and GID 2. >>> Creating user 'ftp' (FTP User) with UID 14 and GID 50. >>> Creating user 'games' (games) with UID 12 and GID 20. >>> Creating user 'halt' (halt) with UID 7 and GID 0. >>> Creating user 'lp' (lp) with UID 4 and GID 7. >>> Creating user 'mail' (mail) with UID 8 and GID 12. >>> Creating user 'nobody' (Kernel Overflow User) with UID 65534 and GID 65534. >>> Creating user 'operator' (operator) with UID 11 and GID 0. >>> Creating user 'root' (Super User) with UID 0 and GID 0. >>> Creating user 'shutdown' (shutdown) with UID 6 and GID 0. >>> Creating user 'sync' (sync) with UID 5 and GID 0. >>> [ 11/150] Installing setup-0:2.15.0-12. 100% | 47.3 MiB/s | 726.6 KiB | 00m00s >>> [RPM] /etc/hosts created as /etc/hosts.rpmnew [ 12/150] Installing filesystem-0:3.18- 100% | 2.6 MiB/s | 212.4 KiB | 00m00s [ 13/150] Installing basesystem-0:11-22 100% | 0.0 B/s | 124.0 B | 00m00s [ 14/150] Installing pkgconf-m4-0:2.3.0 100% | 0.0 B/s | 14.8 KiB | 00m00s [ 15/150] Installing pcre2-syntax-0:10. 100% | 248.1 MiB/s | 254.1 KiB | 00m00s [ 16/150] Installing ncurses-base-0:6.5 100% | 86.0 MiB/s | 352.2 KiB | 00m00s [ 17/150] Installing glibc-minimal-lang 100% | 0.0 B/s | 124.0 B | 00m00s [ 18/150] Installing ncurses-libs-0:6.5 100% | 232.6 MiB/s | 952.8 KiB | 00m00s [ 19/150] Installing glibc-0:2.40.9000- 100% | 195.7 MiB/s | 6.7 MiB | 00m00s [ 20/150] Installing bash-0:5.2.37-1.fc 100% | 255.3 MiB/s | 8.2 MiB | 00m00s [ 21/150] Installing glibc-common-0:2.4 100% | 60.0 MiB/s | 1.0 MiB | 00m00s [ 22/150] Installing glibc-gconv-extra- 100% | 252.1 MiB/s | 7.3 MiB | 00m00s [ 23/150] Installing zlib-ng-compat-0:2 100% | 135.2 MiB/s | 138.4 KiB | 00m00s [ 24/150] Installing bzip2-libs-0:1.0.8 100% | 83.7 MiB/s | 85.7 KiB | 00m00s [ 25/150] Installing xz-libs-1:5.6.3-3. 100% | 214.3 MiB/s | 219.4 KiB | 00m00s [ 26/150] Installing libuuid-0:2.40.4-7 100% | 0.0 B/s | 38.4 KiB | 00m00s [ 27/150] Installing libblkid-0:2.40.4- 100% | 257.4 MiB/s | 263.5 KiB | 00m00s [ 28/150] Installing gmp-1:6.3.0-2.fc41 100% | 397.3 MiB/s | 813.7 KiB | 00m00s [ 29/150] Installing popt-0:1.19-8.fc42 100% | 68.1 MiB/s | 139.4 KiB | 00m00s [ 30/150] Installing readline-0:8.2-12. 100% | 237.9 MiB/s | 487.1 KiB | 00m00s [ 31/150] Installing libxcrypt-0:4.4.38 100% | 280.4 MiB/s | 287.2 KiB | 00m00s [ 32/150] Installing libzstd-0:1.5.6-3. 100% | 389.2 MiB/s | 797.0 KiB | 00m00s [ 33/150] Installing elfutils-libelf-0: 100% | 390.1 MiB/s | 1.2 MiB | 00m00s [ 34/150] Installing libstdc++-0:15.0.1 100% | 401.2 MiB/s | 2.8 MiB | 00m00s [ 35/150] Installing libattr-0:2.5.2-5. 100% | 0.0 B/s | 28.1 KiB | 00m00s [ 36/150] Installing libacl-0:2.3.2-3.f 100% | 0.0 B/s | 39.2 KiB | 00m00s [ 37/150] Installing dwz-0:0.15-9.fc42. 100% | 22.0 MiB/s | 292.4 KiB | 00m00s [ 38/150] Installing mpfr-0:4.2.1-6.fc4 100% | 271.3 MiB/s | 833.6 KiB | 00m00s [ 39/150] Installing gawk-0:5.3.1-1.fc4 100% | 94.2 MiB/s | 1.7 MiB | 00m00s [ 40/150] Installing unzip-0:6.0-66.fc4 100% | 29.6 MiB/s | 393.8 KiB | 00m00s [ 41/150] Installing file-libs-0:5.46-1 100% | 697.5 MiB/s | 11.9 MiB | 00m00s [ 42/150] Installing file-0:5.46-1.fc42 100% | 5.2 MiB/s | 101.7 KiB | 00m00s [ 43/150] Installing crypto-policies-0: 100% | 31.9 MiB/s | 163.5 KiB | 00m00s [ 44/150] Installing pcre2-0:10.44-1.fc 100% | 317.7 MiB/s | 650.7 KiB | 00m00s [ 45/150] Installing grep-0:3.11-10.fc4 100% | 55.7 MiB/s | 1.0 MiB | 00m00s [ 46/150] Installing xz-1:5.6.3-3.fc42. 100% | 72.3 MiB/s | 1.2 MiB | 00m00s [ 47/150] Installing libeconf-0:0.7.6-1 100% | 64.7 MiB/s | 66.2 KiB | 00m00s [ 48/150] Installing libcap-ng-0:0.8.5- 100% | 73.1 MiB/s | 74.8 KiB | 00m00s [ 49/150] Installing audit-libs-0:4.0.3 100% | 172.6 MiB/s | 353.4 KiB | 00m00s [ 50/150] Installing pam-libs-0:1.7.0-4 100% | 126.1 MiB/s | 129.1 KiB | 00m00s [ 51/150] Installing libcap-0:2.73-2.fc 100% | 14.8 MiB/s | 212.1 KiB | 00m00s [ 52/150] Installing systemd-libs-0:257 100% | 320.5 MiB/s | 2.2 MiB | 00m00s [ 53/150] Installing libsmartcols-0:2.4 100% | 177.3 MiB/s | 181.5 KiB | 00m00s [ 54/150] Installing libsepol-0:3.8-1.f 100% | 403.8 MiB/s | 827.0 KiB | 00m00s [ 55/150] Installing libselinux-0:3.8-1 100% | 189.8 MiB/s | 194.3 KiB | 00m00s [ 56/150] Installing findutils-1:4.10.0 100% | 104.1 MiB/s | 1.9 MiB | 00m00s [ 57/150] Installing sed-0:4.9-4.fc42.x 100% | 52.8 MiB/s | 865.5 KiB | 00m00s [ 58/150] Installing libmount-0:2.40.4- 100% | 348.9 MiB/s | 357.3 KiB | 00m00s [ 59/150] Installing lz4-libs-0:1.10.0- 100% | 154.7 MiB/s | 158.5 KiB | 00m00s [ 60/150] Installing lua-libs-0:5.4.7-2 100% | 275.5 MiB/s | 282.1 KiB | 00m00s [ 61/150] Installing alternatives-0:1.3 100% | 5.1 MiB/s | 67.7 KiB | 00m00s [ 62/150] Installing libffi-0:3.4.6-5.f 100% | 81.7 MiB/s | 83.7 KiB | 00m00s [ 63/150] Installing libtasn1-0:4.20.0- 100% | 173.9 MiB/s | 178.1 KiB | 00m00s [ 64/150] Installing p11-kit-0:0.25.5-5 100% | 109.2 MiB/s | 2.2 MiB | 00m00s [ 65/150] Installing libunistring-0:1.1 100% | 345.3 MiB/s | 1.7 MiB | 00m00s [ 66/150] Installing libidn2-0:2.3.7-3. 100% | 163.6 MiB/s | 335.0 KiB | 00m00s [ 67/150] Installing libpsl-0:0.21.5-5. 100% | 75.7 MiB/s | 77.5 KiB | 00m00s [ 68/150] Installing p11-kit-trust-0:0. 100% | 19.4 MiB/s | 397.2 KiB | 00m00s [ 69/150] Installing zstd-0:1.5.6-3.fc4 100% | 99.7 MiB/s | 1.7 MiB | 00m00s [ 70/150] Installing util-linux-core-0: 100% | 79.2 MiB/s | 1.4 MiB | 00m00s [ 71/150] Installing tar-2:1.35-5.fc42. 100% | 148.1 MiB/s | 3.0 MiB | 00m00s [ 72/150] Installing libsemanage-0:3.8- 100% | 151.5 MiB/s | 310.2 KiB | 00m00s [ 73/150] Installing shadow-utils-2:4.1 100% | 138.6 MiB/s | 4.0 MiB | 00m00s [ 74/150] Installing systemd-standalone 100% | 20.9 MiB/s | 277.8 KiB | 00m00s [ 75/150] Installing zip-0:3.0-43.fc42. 100% | 45.7 MiB/s | 702.4 KiB | 00m00s [ 76/150] Installing libfdisk-0:2.40.4- 100% | 364.7 MiB/s | 373.4 KiB | 00m00s [ 77/150] Installing libxml2-0:2.12.9-2 100% | 100.5 MiB/s | 1.7 MiB | 00m00s [ 78/150] Installing bzip2-0:1.0.8-20.f 100% | 7.8 MiB/s | 103.8 KiB | 00m00s [ 79/150] Installing add-determinism-0: 100% | 129.8 MiB/s | 2.5 MiB | 00m00s [ 80/150] Installing build-reproducibil 100% | 0.0 B/s | 1.0 KiB | 00m00s [ 81/150] Installing sqlite-libs-0:3.47 100% | 376.1 MiB/s | 1.5 MiB | 00m00s [ 82/150] Installing ed-0:1.21-2.fc42.x 100% | 11.2 MiB/s | 148.8 KiB | 00m00s [ 83/150] Installing patch-0:2.7.6-26.f 100% | 19.5 MiB/s | 260.2 KiB | 00m00s [ 84/150] Installing filesystem-srpm-ma 100% | 0.0 B/s | 38.9 KiB | 00m00s [ 85/150] Installing elfutils-default-y 100% | 408.6 KiB/s | 2.0 KiB | 00m00s [ 86/150] Installing elfutils-libs-0:0. 100% | 220.3 MiB/s | 676.7 KiB | 00m00s [ 87/150] Installing cpio-0:2.15-2.fc41 100% | 64.7 MiB/s | 1.1 MiB | 00m00s [ 88/150] Installing diffutils-0:3.10-9 100% | 88.3 MiB/s | 1.6 MiB | 00m00s [ 89/150] Installing json-c-0:0.18-2.fc 100% | 85.9 MiB/s | 88.0 KiB | 00m00s [ 90/150] Installing libgomp-0:15.0.1-0 100% | 262.4 MiB/s | 537.3 KiB | 00m00s [ 91/150] Installing jansson-0:2.14-2.f 100% | 92.2 MiB/s | 94.4 KiB | 00m00s [ 92/150] Installing libpkgconf-0:2.3.0 100% | 0.0 B/s | 79.2 KiB | 00m00s [ 93/150] Installing pkgconf-0:2.3.0-2. 100% | 6.8 MiB/s | 91.0 KiB | 00m00s [ 94/150] Installing pkgconf-pkg-config 100% | 147.8 KiB/s | 1.8 KiB | 00m00s [ 95/150] Installing xxhash-libs-0:0.8. 100% | 89.4 MiB/s | 91.6 KiB | 00m00s [ 96/150] Installing libbrotli-0:1.1.0- 100% | 274.6 MiB/s | 843.6 KiB | 00m00s [ 97/150] Installing libnghttp2-0:1.64. 100% | 167.5 MiB/s | 171.5 KiB | 00m00s [ 98/150] Installing keyutils-libs-0:1. 100% | 58.3 MiB/s | 59.7 KiB | 00m00s [ 99/150] Installing libcom_err-0:1.47. 100% | 0.0 B/s | 68.2 KiB | 00m00s [100/150] Installing libverto-0:0.3.2-1 100% | 0.0 B/s | 27.2 KiB | 00m00s [101/150] Installing libtool-ltdl-0:2.5 100% | 0.0 B/s | 71.2 KiB | 00m00s [102/150] Installing gdbm-libs-1:1.23-9 100% | 128.5 MiB/s | 131.6 KiB | 00m00s [103/150] Installing cyrus-sasl-lib-0:2 100% | 121.3 MiB/s | 2.3 MiB | 00m00s [104/150] Installing rust-srpm-macros-0 100% | 0.0 B/s | 5.6 KiB | 00m00s [105/150] Installing qt6-srpm-macros-0: 100% | 0.0 B/s | 740.0 B | 00m00s [106/150] Installing qt5-srpm-macros-0: 100% | 0.0 B/s | 776.0 B | 00m00s [107/150] Installing perl-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [108/150] Installing package-notes-srpm 100% | 0.0 B/s | 2.0 KiB | 00m00s [109/150] Installing openblas-srpm-macr 100% | 0.0 B/s | 392.0 B | 00m00s [110/150] Installing ocaml-srpm-macros- 100% | 0.0 B/s | 2.2 KiB | 00m00s [111/150] Installing kernel-srpm-macros 100% | 0.0 B/s | 2.3 KiB | 00m00s [112/150] Installing gnat-srpm-macros-0 100% | 0.0 B/s | 1.3 KiB | 00m00s [113/150] Installing ghc-srpm-macros-0: 100% | 0.0 B/s | 1.0 KiB | 00m00s [114/150] Installing fpc-srpm-macros-0: 100% | 0.0 B/s | 420.0 B | 00m00s [115/150] Installing ansible-srpm-macro 100% | 35.4 MiB/s | 36.2 KiB | 00m00s [116/150] Installing coreutils-common-0 100% | 398.3 MiB/s | 11.2 MiB | 00m00s [117/150] Installing openssl-libs-1:3.2 100% | 412.4 MiB/s | 7.8 MiB | 00m00s [118/150] Installing coreutils-0:9.6-1. 100% | 168.1 MiB/s | 5.5 MiB | 00m00s [119/150] Installing ca-certificates-0: 100% | 2.0 MiB/s | 2.4 MiB | 00m01s [120/150] Installing libarchive-0:3.7.7 100% | 229.6 MiB/s | 940.6 KiB | 00m00s [121/150] Installing krb5-libs-0:1.21.3 100% | 287.5 MiB/s | 2.3 MiB | 00m00s [122/150] Installing libssh-0:0.11.1-4. 100% | 277.1 MiB/s | 567.5 KiB | 00m00s [123/150] Installing gzip-0:1.13-3.fc42 100% | 25.9 MiB/s | 398.4 KiB | 00m00s [124/150] Installing rpm-sequoia-0:1.7. 100% | 402.4 MiB/s | 2.4 MiB | 00m00s [125/150] Installing rpm-libs-0:4.20.0- 100% | 353.2 MiB/s | 723.3 KiB | 00m00s [126/150] Installing rpm-build-libs-0:4 100% | 198.7 MiB/s | 203.4 KiB | 00m00s [127/150] Installing libevent-0:2.1.12- 100% | 295.2 MiB/s | 906.9 KiB | 00m00s [128/150] Installing openldap-0:2.6.9-3 100% | 214.5 MiB/s | 658.9 KiB | 00m00s [129/150] Installing libcurl-0:8.11.1-4 100% | 274.5 MiB/s | 843.2 KiB | 00m00s [130/150] Installing elfutils-debuginfo 100% | 6.5 MiB/s | 86.2 KiB | 00m00s [131/150] Installing elfutils-0:0.192-8 100% | 134.4 MiB/s | 2.7 MiB | 00m00s [132/150] Installing binutils-0:2.44-3. 100% | 332.1 MiB/s | 25.9 MiB | 00m00s [133/150] Installing gdb-minimal-0:16.2 100% | 302.2 MiB/s | 13.3 MiB | 00m00s [134/150] Installing debugedit-0:5.1-4. 100% | 15.3 MiB/s | 203.1 KiB | 00m00s [135/150] Installing curl-0:8.11.1-4.fc 100% | 21.1 MiB/s | 453.1 KiB | 00m00s [136/150] Installing rpm-0:4.20.0-8.fc4 100% | 95.7 MiB/s | 2.5 MiB | 00m00s [137/150] Installing efi-srpm-macros-0: 100% | 0.0 B/s | 41.1 KiB | 00m00s [138/150] Installing lua-srpm-macros-0: 100% | 0.0 B/s | 1.9 KiB | 00m00s [139/150] Installing tree-sitter-srpm-m 100% | 0.0 B/s | 7.4 KiB | 00m00s [140/150] Installing zig-srpm-macros-0: 100% | 0.0 B/s | 1.7 KiB | 00m00s [141/150] Installing fonts-srpm-macros- 100% | 0.0 B/s | 57.0 KiB | 00m00s [142/150] Installing forge-srpm-macros- 100% | 0.0 B/s | 40.3 KiB | 00m00s [143/150] Installing go-srpm-macros-0:3 100% | 60.5 MiB/s | 62.0 KiB | 00m00s [144/150] Installing python-srpm-macros 100% | 50.9 MiB/s | 52.2 KiB | 00m00s [145/150] Installing redhat-rpm-config- 100% | 94.5 MiB/s | 193.5 KiB | 00m00s [146/150] Installing rpm-build-0:4.20.0 100% | 12.1 MiB/s | 173.7 KiB | 00m00s [147/150] Installing pyproject-srpm-mac 100% | 0.0 B/s | 2.5 KiB | 00m00s [148/150] Installing which-0:2.23-1.fc4 100% | 6.0 MiB/s | 85.6 KiB | 00m00s [149/150] Installing util-linux-0:2.40. 100% | 108.2 MiB/s | 3.5 MiB | 00m00s [150/150] Installing info-0:7.2-3.fc42. 100% | 232.9 KiB/s | 358.3 KiB | 00m02s Complete! Finish: installing minimal buildroot with dnf5 Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: add-determinism-0.6.0-1.fc42.x86_64 alternatives-1.31-3.fc42.x86_64 ansible-srpm-macros-1-17.1.fc42.noarch audit-libs-4.0.3-2.fc42.x86_64 basesystem-11-22.fc42.noarch bash-5.2.37-1.fc42.x86_64 binutils-2.44-3.fc42.x86_64 build-reproducibility-srpm-macros-0.6.0-1.fc42.noarch bzip2-1.0.8-20.fc42.x86_64 bzip2-libs-1.0.8-20.fc42.x86_64 ca-certificates-2024.2.69_v8.0.401-5.fc42.noarch coreutils-9.6-1.fc42.x86_64 coreutils-common-9.6-1.fc42.x86_64 cpio-2.15-2.fc41.x86_64 crypto-policies-20250214-1.gitff7551b.fc42.noarch curl-8.11.1-4.fc42.x86_64 cyrus-sasl-lib-2.1.28-30.fc42.x86_64 debugedit-5.1-4.fc42.x86_64 diffutils-3.10-9.fc42.x86_64 dwz-0.15-9.fc42.x86_64 ed-1.21-2.fc42.x86_64 efi-srpm-macros-6-2.fc42.noarch elfutils-0.192-8.fc42.x86_64 elfutils-debuginfod-client-0.192-8.fc42.x86_64 elfutils-default-yama-scope-0.192-8.fc42.noarch elfutils-libelf-0.192-8.fc42.x86_64 elfutils-libs-0.192-8.fc42.x86_64 fedora-gpg-keys-42-0.5.noarch fedora-release-42-0.21.noarch fedora-release-common-42-0.21.noarch fedora-release-identity-basic-42-0.21.noarch fedora-repos-42-0.5.noarch file-5.46-1.fc42.x86_64 file-libs-5.46-1.fc42.x86_64 filesystem-3.18-36.fc42.x86_64 filesystem-srpm-macros-3.18-36.fc42.noarch findutils-4.10.0-5.fc42.x86_64 fonts-srpm-macros-2.0.5-21.fc42.noarch forge-srpm-macros-0.4.0-2.fc42.noarch fpc-srpm-macros-1.3-14.fc42.noarch gawk-5.3.1-1.fc42.x86_64 gdb-minimal-16.2-2.fc42.x86_64 gdbm-libs-1.23-9.fc42.x86_64 ghc-srpm-macros-1.9.2-2.fc42.noarch glibc-2.40.9000-35.fc42.x86_64 glibc-common-2.40.9000-35.fc42.x86_64 glibc-gconv-extra-2.40.9000-35.fc42.x86_64 glibc-minimal-langpack-2.40.9000-35.fc42.x86_64 gmp-6.3.0-2.fc41.x86_64 gnat-srpm-macros-6-7.fc42.noarch go-srpm-macros-3.6.0-6.fc42.noarch gpg-pubkey-105ef944-65ca83d1 grep-3.11-10.fc42.x86_64 gzip-1.13-3.fc42.x86_64 info-7.2-3.fc42.x86_64 jansson-2.14-2.fc42.x86_64 json-c-0.18-2.fc42.x86_64 kernel-srpm-macros-1.0-25.fc42.noarch keyutils-libs-1.6.3-5.fc42.x86_64 krb5-libs-1.21.3-5.fc42.x86_64 libacl-2.3.2-3.fc42.x86_64 libarchive-3.7.7-2.fc42.x86_64 libattr-2.5.2-5.fc42.x86_64 libblkid-2.40.4-7.fc42.x86_64 libbrotli-1.1.0-6.fc42.x86_64 libcap-2.73-2.fc42.x86_64 libcap-ng-0.8.5-4.fc42.x86_64 libcom_err-1.47.2-3.fc42.x86_64 libcurl-8.11.1-4.fc42.x86_64 libeconf-0.7.6-1.fc42.x86_64 libevent-2.1.12-15.fc42.x86_64 libfdisk-2.40.4-7.fc42.x86_64 libffi-3.4.6-5.fc42.x86_64 libgcc-15.0.1-0.9.fc42.x86_64 libgomp-15.0.1-0.9.fc42.x86_64 libidn2-2.3.7-3.fc42.x86_64 libmount-2.40.4-7.fc42.x86_64 libnghttp2-1.64.0-3.fc42.x86_64 libpkgconf-2.3.0-2.fc42.x86_64 libpsl-0.21.5-5.fc42.x86_64 libselinux-3.8-1.fc42.x86_64 libsemanage-3.8-1.fc42.x86_64 libsepol-3.8-1.fc42.x86_64 libsmartcols-2.40.4-7.fc42.x86_64 libssh-0.11.1-4.fc42.x86_64 libssh-config-0.11.1-4.fc42.noarch libstdc++-15.0.1-0.9.fc42.x86_64 libtasn1-4.20.0-1.fc42.x86_64 libtool-ltdl-2.5.4-4.fc42.x86_64 libunistring-1.1-9.fc42.x86_64 libuuid-2.40.4-7.fc42.x86_64 libverto-0.3.2-10.fc42.x86_64 libxcrypt-4.4.38-6.fc42.x86_64 libxml2-2.12.9-2.fc42.x86_64 libzstd-1.5.6-3.fc42.x86_64 lua-libs-5.4.7-2.fc42.x86_64 lua-srpm-macros-1-15.fc42.noarch lz4-libs-1.10.0-2.fc42.x86_64 mpfr-4.2.1-6.fc42.x86_64 ncurses-base-6.5-5.20250125.fc42.noarch ncurses-libs-6.5-5.20250125.fc42.x86_64 ocaml-srpm-macros-10-4.fc42.noarch openblas-srpm-macros-2-19.fc42.noarch openldap-2.6.9-3.fc42.x86_64 openssl-libs-3.2.4-1.fc42.x86_64 p11-kit-0.25.5-5.fc42.x86_64 p11-kit-trust-0.25.5-5.fc42.x86_64 package-notes-srpm-macros-0.5-13.fc42.noarch pam-libs-1.7.0-4.fc42.x86_64 patch-2.7.6-26.fc42.x86_64 pcre2-10.44-1.fc42.2.x86_64 pcre2-syntax-10.44-1.fc42.2.noarch perl-srpm-macros-1-57.fc42.noarch pkgconf-2.3.0-2.fc42.x86_64 pkgconf-m4-2.3.0-2.fc42.noarch pkgconf-pkg-config-2.3.0-2.fc42.x86_64 popt-1.19-8.fc42.x86_64 publicsuffix-list-dafsa-20250116-1.fc42.noarch pyproject-srpm-macros-1.17.0-1.fc42.noarch python-srpm-macros-3.13-4.fc42.noarch qt5-srpm-macros-5.15.15-1.fc42.noarch qt6-srpm-macros-6.8.2-2.fc42.noarch readline-8.2-12.fc42.x86_64 redhat-rpm-config-342-2.fc42.noarch rpm-4.20.0-8.fc42.x86_64 rpm-build-4.20.0-8.fc42.x86_64 rpm-build-libs-4.20.0-8.fc42.x86_64 rpm-libs-4.20.0-8.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 rust-srpm-macros-26.3-4.fc42.noarch sed-4.9-4.fc42.x86_64 setup-2.15.0-12.fc42.noarch shadow-utils-4.17.0-4.fc42.x86_64 sqlite-libs-3.47.2-2.fc42.x86_64 systemd-libs-257.3-7.fc42.x86_64 systemd-standalone-sysusers-257.3-7.fc42.x86_64 tar-1.35-5.fc42.x86_64 tree-sitter-srpm-macros-0.1.0-8.fc42.noarch unzip-6.0-66.fc42.x86_64 util-linux-2.40.4-7.fc42.x86_64 util-linux-core-2.40.4-7.fc42.x86_64 which-2.23-1.fc42.x86_64 xxhash-libs-0.8.3-2.fc42.x86_64 xz-5.6.3-3.fc42.x86_64 xz-libs-5.6.3-3.fc42.x86_64 zig-srpm-macros-1-4.fc42.noarch zip-3.0-43.fc42.x86_64 zlib-ng-compat-2.2.3-2.fc42.x86_64 zstd-1.5.6-3.fc42.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1737158400 Wrote: /builddir/build/SRPMS/rccl-6.3.0-3.fc42.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-42-x86_64-1741782565.967003/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-26vvpt8i/rccl/rccl.spec) Config(child) 0 minutes 16 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/rccl-6.3.0-3.fc42.src.rpm) Config(fedora-42-x86_64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-42-x86_64-bootstrap-1741782565.967003/root. INFO: reusing tmpfs at /var/lib/mock/fedora-42-x86_64-bootstrap-1741782565.967003/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-42-x86_64-1741782565.967003/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.20.0-8.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 dnf5-5.2.10.0-2.fc42.x86_64 dnf5-plugins-5.2.10.0-2.fc42.x86_64 Finish: chroot init Start: build phase for rccl-6.3.0-3.fc42.src.rpm Start: build setup for rccl-6.3.0-3.fc42.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1737158400 Wrote: /builddir/build/SRPMS/rccl-6.3.0-3.fc42.src.rpm Updating and loading repositories: fedora 100% | 963.6 KiB/s | 27.0 KiB | 00m00s updates 100% | 198.0 KiB/s | 30.3 KiB | 00m00s Copr repository 100% | 165.3 KiB/s | 2.1 KiB | 00m00s Copr repository 100% | 5.9 MiB/s | 120.4 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing: cmake x86_64 3.31.5-1.fc42 fedora 34.2 MiB hipify x86_64 6.3.0-3.fc42 copr_base 2.9 MiB rocm-cmake noarch 6.3.0-2.fc42 copr_base 129.8 KiB rocm-comgr-devel x86_64 18-37.rocm6.3.1.fc42 copr_base 103.0 KiB rocm-core-devel x86_64 6.3.1-2.fc42 copr_base 14.8 KiB rocm-hip-devel x86_64 6.3.1-3.fc42 copr_base 2.7 MiB rocm-rpm-macros noarch 6.3.1-5.fc42 fedora 18.7 KiB rocm-runtime-devel x86_64 6.3.1-4.fc42 copr_base 565.6 KiB rocm-smi-devel x86_64 6.3.1-3.fc42 copr_base 237.6 KiB Installing dependencies: annobin-docs noarch 12.88-1.fc42 fedora 98.6 KiB annobin-plugin-gcc x86_64 12.88-1.fc42 fedora 991.7 KiB clang19-libs x86_64 19.1.7-12.fc42 fedora 124.2 MiB clang19-resource-filesystem x86_64 19.1.7-12.fc42 fedora 14.8 KiB cmake-data noarch 3.31.5-1.fc42 fedora 8.5 MiB cmake-filesystem x86_64 3.31.5-1.fc42 fedora 0.0 B cmake-rpm-macros noarch 3.31.5-1.fc42 fedora 7.7 KiB cpp x86_64 15.0.1-0.9.fc42 fedora 37.6 MiB emacs-filesystem noarch 1:30.0-4.fc42 fedora 0.0 B environment-modules x86_64 5.5.0-3.fc42 fedora 1.8 MiB expat x86_64 2.6.4-2.fc42 fedora 292.8 KiB gcc x86_64 15.0.1-0.9.fc42 fedora 110.2 MiB gcc-c++ x86_64 15.0.1-0.9.fc42 fedora 40.8 MiB gcc-plugin-annobin x86_64 15.0.1-0.9.fc42 fedora 57.2 KiB git x86_64 2.48.1-3.fc42 fedora 85.3 KiB git-core x86_64 2.48.1-3.fc42 fedora 22.7 MiB git-core-doc noarch 2.48.1-3.fc42 fedora 17.4 MiB glibc-devel x86_64 2.40.9000-35.fc42 fedora 2.3 MiB gnupg2 x86_64 2.4.7-2.fc42 fedora 9.8 MiB gnutls x86_64 3.8.9-2.fc42 fedora 3.6 MiB groff-base x86_64 1.23.0-8.fc42 fedora 3.9 MiB hipcc x86_64 18-37.rocm6.3.1.fc42 copr_base 761.6 KiB hwdata noarch 0.392-1.fc42 fedora 9.4 MiB jsoncpp x86_64 1.9.5-9.fc42 fedora 265.5 KiB kernel-headers x86_64 6.14.0-0.rc3.29.fc42 fedora 6.5 MiB less x86_64 668-2.fc42 fedora 405.8 KiB libassuan x86_64 2.5.7-3.fc42 fedora 167.8 KiB libb2 x86_64 0.98.1-13.fc42 fedora 46.1 KiB libcbor x86_64 0.11.0-3.fc42 fedora 77.8 KiB libdb x86_64 5.3.28-64.fc42 fedora 1.9 MiB libdrm x86_64 2.4.124-2.fc42 fedora 407.9 KiB libedit x86_64 3.1-55.20250104cvs.fc42 fedora 244.1 KiB libfido2 x86_64 1.15.0-3.fc42 fedora 242.1 KiB libgcrypt x86_64 1.11.0-5.fc42 fedora 1.6 MiB libgpg-error x86_64 1.51-2.fc42 fedora 894.1 KiB libksba x86_64 1.6.7-3.fc42 fedora 402.5 KiB libmpc x86_64 1.3.1-7.fc42 fedora 164.5 KiB libpciaccess x86_64 0.16-15.fc42 fedora 44.5 KiB libpipeline x86_64 1.5.8-2.fc42 fedora 145.1 KiB libstdc++-devel x86_64 15.0.1-0.9.fc42 fedora 15.9 MiB libtommath x86_64 1.3.1~rc1-5.fc42 fedora 130.4 KiB libusb1 x86_64 1.0.27-9.fc42 fedora 166.5 KiB libuv x86_64 1:1.50.0-1.fc42 fedora 566.8 KiB libxcrypt-devel x86_64 4.4.38-6.fc42 fedora 30.8 KiB llvm19-filesystem x86_64 19.1.7-12.fc42 fedora 0.0 B llvm19-libs x86_64 19.1.7-12.fc42 fedora 124.0 MiB make x86_64 1:4.4.1-10.fc42 fedora 1.8 MiB man-db x86_64 2.13.0-2.fc42 fedora 2.8 MiB mpdecimal x86_64 4.0.0-2.fc42 fedora 216.8 KiB ncurses x86_64 6.5-5.20250125.fc42 fedora 608.1 KiB nettle x86_64 3.10.1-1.fc42 fedora 790.5 KiB npth x86_64 1.8-2.fc42 fedora 49.6 KiB numactl-libs x86_64 2.0.19-2.fc42 fedora 52.9 KiB openssh x86_64 9.9p1-7.fc42 fedora 1.4 MiB openssh-clients x86_64 9.9p1-7.fc42 fedora 2.7 MiB perl x86_64 4:5.40.1-515.fc42 fedora 0.0 B perl-Algorithm-Diff noarch 1.2010-13.fc42 fedora 107.5 KiB perl-Archive-Tar noarch 3.02-513.fc42 fedora 154.0 KiB perl-Archive-Zip noarch 1.68-16.fc42 fedora 291.1 KiB perl-Attribute-Handlers noarch 1.03-515.fc42 fedora 39.9 KiB perl-AutoLoader noarch 5.74-515.fc42 fedora 20.5 KiB perl-AutoSplit noarch 5.74-515.fc42 fedora 23.1 KiB perl-B x86_64 1.89-515.fc42 fedora 498.0 KiB perl-Benchmark noarch 1.25-515.fc42 fedora 36.3 KiB perl-CPAN noarch 2.38-3.fc42 fedora 1.9 MiB perl-CPAN-Meta noarch 2.150010-512.fc42 fedora 592.2 KiB perl-CPAN-Meta-Requirements noarch 2.143-10.fc42 fedora 81.2 KiB perl-CPAN-Meta-YAML noarch 0.020-2.fc42 fedora 52.1 KiB perl-Carp noarch 1.54-512.fc42 fedora 46.6 KiB perl-Class-Struct noarch 0.68-515.fc42 fedora 25.4 KiB perl-Compress-Bzip2 x86_64 2.28-21.fc42 fedora 142.6 KiB perl-Compress-Raw-Bzip2 x86_64 2.213-2.fc42 fedora 67.3 KiB perl-Compress-Raw-Lzma x86_64 2.213-5.fc42 fedora 120.9 KiB perl-Compress-Raw-Zlib x86_64 2.213-2.fc42 fedora 163.2 KiB perl-Config-Extensions noarch 0.03-515.fc42 fedora 2.6 KiB perl-Config-Perl-V noarch 0.38-2.fc42 fedora 25.9 KiB perl-DBM_Filter noarch 0.06-515.fc42 fedora 28.5 KiB perl-DB_File x86_64 1.859-513.fc42 fedora 188.8 KiB perl-Data-Dumper x86_64 2.189-513.fc42 fedora 115.6 KiB perl-Data-OptList noarch 0.114-6.fc42 fedora 50.1 KiB perl-Data-Section noarch 0.200008-7.fc42 fedora 42.7 KiB perl-Devel-PPPort x86_64 3.72-513.fc42 fedora 892.1 KiB perl-Devel-Peek x86_64 1.34-515.fc42 fedora 43.5 KiB perl-Devel-SelfStubber noarch 1.06-515.fc42 fedora 6.7 KiB perl-Devel-Size x86_64 0.84-4.fc42 fedora 41.7 KiB perl-Digest noarch 1.20-512.fc42 fedora 35.3 KiB perl-Digest-MD5 x86_64 2.59-6.fc42 fedora 59.7 KiB perl-Digest-SHA x86_64 1:6.04-513.fc42 fedora 112.5 KiB perl-DirHandle noarch 1.05-515.fc42 fedora 3.4 KiB perl-Dumpvalue noarch 2.27-515.fc42 fedora 19.8 KiB perl-DynaLoader x86_64 1.56-515.fc42 fedora 32.1 KiB perl-Encode x86_64 4:3.21-512.fc42 fedora 4.7 MiB perl-Encode-devel x86_64 4:3.21-512.fc42 fedora 99.6 KiB perl-English noarch 1.11-515.fc42 fedora 6.2 KiB perl-Env noarch 1.06-512.fc42 fedora 26.1 KiB perl-Errno x86_64 1.38-515.fc42 fedora 8.3 KiB perl-Error noarch 1:0.17030-1.fc42 fedora 76.7 KiB perl-Exporter noarch 5.78-512.fc42 fedora 54.3 KiB perl-ExtUtils-CBuilder noarch 1:0.280240-512.fc42 fedora 96.9 KiB perl-ExtUtils-Command noarch 2:7.70-513.fc42 fedora 9.6 KiB perl-ExtUtils-Constant noarch 0.25-515.fc42 fedora 85.8 KiB perl-ExtUtils-Embed noarch 1.35-515.fc42 fedora 15.5 KiB perl-ExtUtils-Install noarch 2.22-512.fc42 fedora 85.5 KiB perl-ExtUtils-MM-Utils noarch 2:7.70-513.fc42 fedora 2.9 KiB perl-ExtUtils-MakeMaker noarch 2:7.70-513.fc42 fedora 734.1 KiB perl-ExtUtils-Manifest noarch 1:1.75-512.fc42 fedora 84.8 KiB perl-ExtUtils-Miniperl noarch 1.14-515.fc42 fedora 8.2 KiB perl-ExtUtils-ParseXS noarch 1:3.51-512.fc42 fedora 399.6 KiB perl-Fcntl x86_64 1.18-515.fc42 fedora 48.9 KiB perl-File-Basename noarch 2.86-515.fc42 fedora 14.0 KiB perl-File-Compare noarch 1.100.800-515.fc42 fedora 5.6 KiB perl-File-Copy noarch 2.41-515.fc42 fedora 19.6 KiB perl-File-DosGlob x86_64 1.12-515.fc42 fedora 20.8 KiB perl-File-Fetch noarch 1.04-512.fc42 fedora 59.2 KiB perl-File-Find noarch 1.44-515.fc42 fedora 41.9 KiB perl-File-HomeDir noarch 1.006-14.fc42 fedora 119.3 KiB perl-File-Path noarch 2.18-512.fc42 fedora 63.5 KiB perl-File-Temp noarch 1:0.231.100-512.fc42 fedora 162.3 KiB perl-File-Which noarch 1.27-13.fc42 fedora 30.4 KiB perl-File-stat noarch 1.14-515.fc42 fedora 12.5 KiB perl-FileCache noarch 1.10-515.fc42 fedora 7.4 KiB perl-FileHandle noarch 2.05-515.fc42 fedora 9.3 KiB perl-Filter x86_64 2:1.64-513.fc42 fedora 156.7 KiB perl-Filter-Simple noarch 0.96-512.fc42 fedora 50.7 KiB perl-FindBin noarch 1.54-515.fc42 fedora 6.7 KiB perl-GDBM_File x86_64 1:1.24-515.fc42 fedora 79.6 KiB perl-Getopt-Long noarch 1:2.58-3.fc42 fedora 144.5 KiB perl-Getopt-Std noarch 1.14-515.fc42 fedora 11.2 KiB perl-Git noarch 2.48.1-3.fc42 fedora 64.0 KiB perl-HTTP-Tiny noarch 0.090-2.fc42 fedora 154.4 KiB perl-Hash-Util x86_64 0.32-515.fc42 fedora 55.0 KiB perl-Hash-Util-FieldHash x86_64 1.27-515.fc42 fedora 62.5 KiB perl-I18N-Collate noarch 1.02-515.fc42 fedora 7.1 KiB perl-I18N-LangTags noarch 0.45-515.fc42 fedora 82.3 KiB perl-I18N-Langinfo x86_64 0.24-515.fc42 fedora 34.7 KiB perl-IO x86_64 1.55-515.fc42 fedora 147.0 KiB perl-IO-Compress noarch 2.213-3.fc42 fedora 1.0 MiB perl-IO-Compress-Lzma noarch 2.213-2.fc42 fedora 215.2 KiB perl-IO-Socket-IP noarch 0.43-2.fc42 fedora 100.3 KiB perl-IO-Socket-SSL noarch 2.089-2.fc42 fedora 703.3 KiB perl-IO-Zlib noarch 1:1.15-512.fc42 fedora 25.7 KiB perl-IPC-Cmd noarch 2:1.04-513.fc42 fedora 84.9 KiB perl-IPC-Open3 noarch 1.22-515.fc42 fedora 22.5 KiB perl-IPC-SysV x86_64 2.09-513.fc42 fedora 73.7 KiB perl-IPC-System-Simple noarch 1.30-15.fc42 fedora 71.7 KiB perl-JSON-PP noarch 1:4.16-513.fc42 fedora 141.8 KiB perl-Locale-Maketext noarch 1.33-513.fc42 fedora 171.3 KiB perl-Locale-Maketext-Simple noarch 1:0.21-515.fc42 fedora 12.8 KiB perl-MIME-Base32 noarch 1.303-23.fc42 fedora 30.7 KiB perl-MIME-Base64 x86_64 3.16-512.fc42 fedora 42.0 KiB perl-MRO-Compat noarch 0.15-11.fc42 fedora 43.0 KiB perl-Math-BigInt noarch 1:2.0030.04-1.fc42 fedora 962.3 KiB perl-Math-BigInt-FastCalc x86_64 0.501.800-512.fc42 fedora 43.9 KiB perl-Math-Complex noarch 1.62-515.fc42 fedora 85.0 KiB perl-Memoize noarch 1.16-515.fc42 fedora 64.5 KiB perl-Module-Build noarch 2:0.42.34-8.fc42 fedora 654.2 KiB perl-Module-CoreList noarch 1:5.20250120-1.fc42 fedora 1.2 MiB perl-Module-CoreList-tools noarch 1:5.20250120-1.fc42 fedora 18.6 KiB perl-Module-Load noarch 1:0.36-512.fc42 fedora 14.9 KiB perl-Module-Load-Conditional noarch 0.74-512.fc42 fedora 28.7 KiB perl-Module-Loaded noarch 1:0.08-515.fc42 fedora 5.0 KiB perl-Module-Metadata noarch 1.000038-512.fc42 fedora 67.5 KiB perl-Module-Signature noarch 0.89-2.fc42 fedora 139.4 KiB perl-NDBM_File x86_64 1.17-515.fc42 fedora 28.4 KiB perl-NEXT noarch 0.69-515.fc42 fedora 23.5 KiB perl-Net noarch 1.04-515.fc42 fedora 22.3 KiB perl-Net-Ping noarch 2.76-512.fc42 fedora 134.2 KiB perl-Net-SSLeay x86_64 1.94-8.fc42 fedora 1.3 MiB perl-ODBM_File x86_64 1.18-515.fc42 fedora 28.3 KiB perl-Opcode x86_64 1.65-515.fc42 fedora 48.4 KiB perl-POSIX x86_64 2.20-515.fc42 fedora 231.0 KiB perl-Package-Generator noarch 1.106-33.fc42 fedora 29.9 KiB perl-Params-Check noarch 1:0.38-512.fc42 fedora 27.6 KiB perl-Params-Util x86_64 1.102-17.fc42 fedora 58.5 KiB perl-PathTools x86_64 3.91-513.fc42 fedora 180.0 KiB perl-Perl-OSType noarch 1.010-513.fc42 fedora 32.8 KiB perl-PerlIO-via-QuotedPrint noarch 0.10-512.fc42 fedora 30.2 KiB perl-Pod-Checker noarch 4:1.77-512.fc42 fedora 52.2 KiB perl-Pod-Escapes noarch 1:1.07-512.fc42 fedora 24.9 KiB perl-Pod-Functions noarch 1.14-515.fc42 fedora 14.2 KiB perl-Pod-Html noarch 1.35-515.fc42 fedora 42.2 KiB perl-Pod-Perldoc noarch 3.28.01-513.fc42 fedora 163.7 KiB perl-Pod-Simple noarch 1:3.45-512.fc42 fedora 560.8 KiB perl-Pod-Usage noarch 4:2.03-512.fc42 fedora 84.8 KiB perl-Safe noarch 2.46-515.fc42 fedora 30.6 KiB perl-Scalar-List-Utils x86_64 5:1.68-2.fc42 fedora 144.8 KiB perl-Search-Dict noarch 1.07-515.fc42 fedora 4.7 KiB perl-SelectSaver noarch 1.02-515.fc42 fedora 2.2 KiB perl-SelfLoader noarch 1.27-515.fc42 fedora 22.4 KiB perl-Socket x86_64 4:2.038-512.fc42 fedora 119.9 KiB perl-Software-License noarch 0.104006-3.fc42 fedora 501.9 KiB perl-Storable x86_64 1:3.32-512.fc42 fedora 232.3 KiB perl-Sub-Exporter noarch 0.991-5.fc42 fedora 194.9 KiB perl-Sub-Install noarch 0.929-7.fc42 fedora 35.9 KiB perl-Symbol noarch 1.09-515.fc42 fedora 6.8 KiB perl-Sys-Hostname x86_64 1.25-515.fc42 fedora 15.8 KiB perl-Sys-Syslog x86_64 0.36-513.fc42 fedora 94.7 KiB perl-Term-ANSIColor noarch 5.01-513.fc42 fedora 97.5 KiB perl-Term-Cap noarch 1.18-512.fc42 fedora 29.3 KiB perl-Term-Complete noarch 1.403-515.fc42 fedora 5.7 KiB perl-Term-ReadLine noarch 1.17-515.fc42 fedora 17.3 KiB perl-Term-Table noarch 0.024-2.fc42 fedora 77.9 KiB perl-TermReadKey x86_64 2.38-24.fc42 fedora 64.0 KiB perl-Test noarch 1.31-515.fc42 fedora 37.0 KiB perl-Test-Harness noarch 1:3.50-2.fc42 fedora 559.6 KiB perl-Test-Simple noarch 3:1.302209-1.fc42 fedora 1.7 MiB perl-Text-Abbrev noarch 1.02-515.fc42 fedora 3.1 KiB perl-Text-Balanced noarch 2.06-512.fc42 fedora 111.4 KiB perl-Text-Diff noarch 1.45-23.fc42 fedora 83.0 KiB perl-Text-Glob noarch 0.11-25.fc42 fedora 8.4 KiB perl-Text-ParseWords noarch 3.31-512.fc42 fedora 13.6 KiB perl-Text-Tabs+Wrap noarch 2024.001-512.fc42 fedora 22.6 KiB perl-Text-Template noarch 1.61-7.fc42 fedora 112.4 KiB perl-Thread noarch 3.05-515.fc42 fedora 12.1 KiB perl-Thread-Queue noarch 3.14-512.fc42 fedora 28.9 KiB perl-Thread-Semaphore noarch 2.13-515.fc42 fedora 10.0 KiB perl-Tie noarch 4.6-515.fc42 fedora 32.0 KiB perl-Tie-File noarch 1.09-515.fc42 fedora 85.7 KiB perl-Tie-Memoize noarch 1.1-515.fc42 fedora 6.2 KiB perl-Tie-RefHash noarch 1.41-2.fc42 fedora 35.9 KiB perl-Time noarch 1.04-515.fc42 fedora 9.7 KiB perl-Time-HiRes x86_64 4:1.9777-512.fc42 fedora 115.8 KiB perl-Time-Local noarch 2:1.350-512.fc42 fedora 68.9 KiB perl-Time-Piece x86_64 1.3401-515.fc42 fedora 71.0 KiB perl-URI noarch 5.31-2.fc42 fedora 257.0 KiB perl-Unicode-Collate x86_64 1.31-512.fc42 fedora 4.2 MiB perl-Unicode-Normalize x86_64 1.32-512.fc42 fedora 465.1 KiB perl-Unicode-UCD noarch 0.78-515.fc42 fedora 204.4 KiB perl-User-pwent noarch 1.05-515.fc42 fedora 17.0 KiB perl-autodie noarch 2.37-513.fc42 fedora 214.9 KiB perl-autouse noarch 1.11-515.fc42 fedora 5.9 KiB perl-base noarch 2.27-515.fc42 fedora 12.5 KiB perl-bignum noarch 0.67-513.fc42 fedora 133.1 KiB perl-blib noarch 1.07-515.fc42 fedora 3.2 KiB perl-constant noarch 1.33-513.fc42 fedora 26.2 KiB perl-debugger noarch 1.60-515.fc42 fedora 402.2 KiB perl-deprecate noarch 0.04-515.fc42 fedora 6.5 KiB perl-devel x86_64 4:5.40.1-515.fc42 fedora 8.0 MiB perl-diagnostics noarch 1.40-515.fc42 fedora 465.4 KiB perl-doc noarch 5.40.1-515.fc42 fedora 11.0 MiB perl-encoding x86_64 4:3.00-512.fc42 fedora 149.5 KiB perl-encoding-warnings noarch 0.14-515.fc42 fedora 10.1 KiB perl-experimental noarch 0.034-2.fc42 fedora 42.4 KiB perl-fields noarch 2.27-515.fc42 fedora 11.8 KiB perl-filetest noarch 1.03-515.fc42 fedora 6.4 KiB perl-if noarch 0.61.000-515.fc42 fedora 5.8 KiB perl-inc-latest noarch 2:0.500-30.fc42 fedora 34.6 KiB perl-interpreter x86_64 4:5.40.1-515.fc42 fedora 118.1 KiB perl-less noarch 0.03-515.fc42 fedora 4.9 KiB perl-lib x86_64 0.65-515.fc42 fedora 8.5 KiB perl-libnet noarch 3.15-513.fc42 fedora 289.4 KiB perl-libnetcfg noarch 4:5.40.1-515.fc42 fedora 16.9 KiB perl-libs x86_64 4:5.40.1-515.fc42 fedora 9.8 MiB perl-local-lib noarch 2.000029-9.fc42 fedora 117.6 KiB perl-locale noarch 1.12-515.fc42 fedora 6.5 KiB perl-macros noarch 4:5.40.1-515.fc42 fedora 5.5 KiB perl-meta-notation noarch 5.40.1-515.fc42 fedora 2.0 KiB perl-mro x86_64 1.29-515.fc42 fedora 41.5 KiB perl-open noarch 1.13-515.fc42 fedora 11.3 KiB perl-overload noarch 1.37-515.fc42 fedora 71.5 KiB perl-overloading noarch 0.02-515.fc42 fedora 4.8 KiB perl-parent noarch 1:0.244-2.fc42 fedora 10.3 KiB perl-perlfaq noarch 5.20240218-512.fc42 fedora 732.6 KiB perl-ph x86_64 5.40.1-515.fc42 fedora 270.8 KiB perl-podlators noarch 1:6.0.2-3.fc42 fedora 317.5 KiB perl-sigtrap noarch 1.10-515.fc42 fedora 11.0 KiB perl-sort noarch 2.05-515.fc42 fedora 4.8 KiB perl-subs noarch 1.04-515.fc42 fedora 2.1 KiB perl-threads x86_64 1:2.40-512.fc42 fedora 115.0 KiB perl-threads-shared x86_64 1.69-512.fc42 fedora 83.6 KiB perl-utils noarch 5.40.1-515.fc42 fedora 96.8 KiB perl-vars noarch 1.05-515.fc42 fedora 3.9 KiB perl-version x86_64 9:0.99.33-2.fc42 fedora 128.7 KiB perl-vmsish noarch 1.04-515.fc42 fedora 6.5 KiB procps-ng x86_64 4.0.4-6.fc42 fedora 1.0 MiB python-pip-wheel noarch 24.3.1-2.fc42 fedora 1.2 MiB python3 x86_64 3.13.2-2.fc42 fedora 27.6 KiB python3-libs x86_64 3.13.2-2.fc42 fedora 39.9 MiB python3-pyparsing noarch 3.1.2-8.fc42 fedora 996.4 KiB rhash x86_64 1.4.5-2.fc42 fedora 351.0 KiB rocm-clang x86_64 18-37.rocm6.3.1.fc42 copr_base 117.6 MiB rocm-clang-devel x86_64 18-37.rocm6.3.1.fc42 copr_base 21.8 MiB rocm-clang-libs x86_64 18-37.rocm6.3.1.fc42 copr_base 113.9 MiB rocm-clang-runtime-devel x86_64 18-37.rocm6.3.1.fc42 copr_base 6.9 MiB rocm-comgr x86_64 18-37.rocm6.3.1.fc42 copr_base 137.1 MiB rocm-core x86_64 6.3.1-2.fc42 copr_base 12.3 KiB rocm-device-libs x86_64 18-37.rocm6.3.1.fc42 copr_base 3.2 MiB rocm-hip x86_64 6.3.1-3.fc42 copr_base 23.3 MiB rocm-libc++ x86_64 18-37.rocm6.3.1.fc42 copr_base 1.5 MiB rocm-libc++-devel x86_64 18-37.rocm6.3.1.fc42 copr_base 7.0 MiB rocm-lld x86_64 18-37.rocm6.3.1.fc42 copr_base 6.5 MiB rocm-llvm x86_64 18-37.rocm6.3.1.fc42 copr_base 79.3 MiB rocm-llvm-devel x86_64 18-37.rocm6.3.1.fc42 copr_base 24.4 MiB rocm-llvm-filesystem x86_64 18-37.rocm6.3.1.fc42 copr_base 0.0 B rocm-llvm-libs x86_64 18-37.rocm6.3.1.fc42 copr_base 93.8 MiB rocm-llvm-static x86_64 18-37.rocm6.3.1.fc42 copr_base 233.9 MiB rocm-runtime x86_64 6.3.1-4.fc42 copr_base 2.9 MiB rocm-smi x86_64 6.3.1-3.fc42 copr_base 2.5 MiB systemtap-sdt-devel x86_64 5.3~pre17373816g7a71d34b-5.fc42 fedora 182.6 KiB systemtap-sdt-dtrace x86_64 5.3~pre17373816g7a71d34b-5.fc42 fedora 179.1 KiB tcl x86_64 1:9.0.0-7.fc42 fedora 4.3 MiB tpm2-tss x86_64 4.1.3-6.fc42 fedora 1.6 MiB tzdata noarch 2025a-1.fc42 fedora 1.6 MiB vim-filesystem noarch 2:9.1.1081-1.fc42 fedora 40.0 B Transaction Summary: Installing: 313 packages Total size of inbound packages is 362 MiB. Need to download 362 MiB. After this operation, 2 GiB extra will be used (install 2 GiB, remove 0 B). [ 1/313] rocm-cmake-0:6.3.0-2.fc42.noa 100% | 3.4 MiB/s | 38.0 KiB | 00m00s [ 2/313] rocm-comgr-devel-0:18-37.rocm 100% | 31.8 MiB/s | 32.6 KiB | 00m00s [ 3/313] rocm-core-devel-0:6.3.1-2.fc4 100% | 4.5 MiB/s | 13.8 KiB | 00m00s [ 4/313] rocm-hip-devel-0:6.3.1-3.fc42 100% | 73.8 MiB/s | 226.6 KiB | 00m00s [ 5/313] rocm-rpm-macros-0:6.3.1-5.fc4 100% | 7.4 MiB/s | 15.1 KiB | 00m00s [ 6/313] rocm-runtime-devel-0:6.3.1-4. 100% | 45.2 MiB/s | 92.6 KiB | 00m00s [ 7/313] cmake-0:3.31.5-1.fc42.x86_64 100% | 264.8 MiB/s | 12.2 MiB | 00m00s [ 8/313] cmake-data-0:3.31.5-1.fc42.no 100% | 308.0 MiB/s | 2.5 MiB | 00m00s [ 9/313] cmake-filesystem-0:3.31.5-1.f 100% | 17.3 MiB/s | 17.7 KiB | 00m00s [ 10/313] expat-0:2.6.4-2.fc42.x86_64 100% | 112.0 MiB/s | 114.7 KiB | 00m00s [ 11/313] jsoncpp-0:1.9.5-9.fc42.x86_64 100% | 100.1 MiB/s | 102.5 KiB | 00m00s [ 12/313] libuv-1:1.50.0-1.fc42.x86_64 100% | 129.3 MiB/s | 264.8 KiB | 00m00s [ 13/313] rocm-smi-devel-0:6.3.1-3.fc42 100% | 1.2 MiB/s | 48.9 KiB | 00m00s [ 14/313] make-1:4.4.1-10.fc42.x86_64 100% | 191.1 MiB/s | 587.0 KiB | 00m00s [ 15/313] rhash-0:1.4.5-2.fc42.x86_64 100% | 97.0 MiB/s | 198.7 KiB | 00m00s [ 16/313] hipify-0:6.3.0-3.fc42.x86_64 100% | 3.8 MiB/s | 483.6 KiB | 00m00s [ 17/313] perl-4:5.40.1-515.fc42.x86_64 100% | 2.7 MiB/s | 13.7 KiB | 00m00s [ 18/313] perl-interpreter-4:5.40.1-515 100% | 8.8 MiB/s | 72.2 KiB | 00m00s [ 19/313] perl-File-Basename-0:2.86-515 100% | 4.2 MiB/s | 17.2 KiB | 00m00s [ 20/313] perl-File-Copy-0:2.41-515.fc4 100% | 6.6 MiB/s | 20.1 KiB | 00m00s [ 21/313] perl-File-Which-0:1.27-13.fc4 100% | 7.0 MiB/s | 21.6 KiB | 00m00s [ 22/313] perl-Getopt-Std-0:1.14-515.fc 100% | 3.8 MiB/s | 15.7 KiB | 00m00s [ 23/313] perl-PathTools-0:3.91-513.fc4 100% | 14.2 MiB/s | 87.3 KiB | 00m00s [ 24/313] perl-Scalar-List-Utils-5:1.68 100% | 24.3 MiB/s | 74.7 KiB | 00m00s [ 25/313] clang19-libs-0:19.1.7-12.fc42 100% | 214.6 MiB/s | 27.5 MiB | 00m00s [ 26/313] perl-URI-0:5.31-2.fc42.noarch 100% | 4.6 MiB/s | 140.7 KiB | 00m00s [ 27/313] rocm-runtime-0:6.3.1-4.fc42.x 100% | 149.6 MiB/s | 612.6 KiB | 00m00s [ 28/313] llvm19-libs-0:19.1.7-12.fc42. 100% | 195.9 MiB/s | 31.4 MiB | 00m00s [ 29/313] environment-modules-0:5.5.0-3 100% | 21.3 MiB/s | 764.7 KiB | 00m00s [ 30/313] emacs-filesystem-1:30.0-4.fc4 100% | 253.6 KiB/s | 7.4 KiB | 00m00s [ 31/313] vim-filesystem-2:9.1.1081-1.f 100% | 16.0 MiB/s | 16.4 KiB | 00m00s [ 32/313] clang19-resource-filesystem-0 100% | 19.4 MiB/s | 19.9 KiB | 00m00s [ 33/313] llvm19-filesystem-0:19.1.7-12 100% | 7.0 MiB/s | 14.3 KiB | 00m00s [ 34/313] libedit-0:3.1-55.20250104cvs. 100% | 51.4 MiB/s | 105.3 KiB | 00m00s [ 35/313] perl-Attribute-Handlers-0:1.0 100% | 13.7 MiB/s | 28.1 KiB | 00m00s [ 36/313] perl-AutoLoader-0:5.74-515.fc 100% | 10.4 MiB/s | 21.2 KiB | 00m00s [ 37/313] perl-Archive-Tar-0:3.02-513.f 100% | 23.1 MiB/s | 70.9 KiB | 00m00s [ 38/313] perl-AutoSplit-0:5.74-515.fc4 100% | 21.1 MiB/s | 21.6 KiB | 00m00s [ 39/313] perl-Benchmark-0:1.25-515.fc4 100% | 13.1 MiB/s | 26.8 KiB | 00m00s [ 40/313] perl-B-0:1.89-515.fc42.x86_64 100% | 57.6 MiB/s | 177.0 KiB | 00m00s [ 41/313] perl-CPAN-0:2.38-3.fc42.noarc 100% | 138.5 MiB/s | 567.2 KiB | 00m00s [ 42/313] perl-CPAN-Meta-Requirements-0 100% | 17.2 MiB/s | 35.2 KiB | 00m00s [ 43/313] perl-CPAN-Meta-0:2.150010-512 100% | 37.3 MiB/s | 190.8 KiB | 00m00s [ 44/313] perl-CPAN-Meta-YAML-0:0.020-2 100% | 8.7 MiB/s | 26.8 KiB | 00m00s [ 45/313] perl-Carp-0:1.54-512.fc42.noa 100% | 9.4 MiB/s | 28.9 KiB | 00m00s [ 46/313] perl-Class-Struct-0:0.68-515. 100% | 10.8 MiB/s | 22.1 KiB | 00m00s [ 47/313] perl-Compress-Raw-Bzip2-0:2.2 100% | 11.8 MiB/s | 36.3 KiB | 00m00s [ 48/313] perl-Config-Extensions-0:0.03 100% | 6.0 MiB/s | 12.3 KiB | 00m00s [ 49/313] perl-Compress-Raw-Zlib-0:2.21 100% | 21.3 MiB/s | 65.5 KiB | 00m00s [ 50/313] perl-Config-Perl-V-0:0.38-2.f 100% | 10.6 MiB/s | 21.8 KiB | 00m00s [ 51/313] perl-DBM_Filter-0:0.06-515.fc 100% | 13.2 MiB/s | 27.1 KiB | 00m00s [ 52/313] perl-DB_File-0:1.859-513.fc42 100% | 39.5 MiB/s | 81.0 KiB | 00m00s [ 53/313] perl-Data-Dumper-0:2.189-513. 100% | 55.3 MiB/s | 56.7 KiB | 00m00s [ 54/313] perl-Devel-Peek-0:1.34-515.fc 100% | 15.6 MiB/s | 31.9 KiB | 00m00s [ 55/313] perl-Devel-PPPort-0:3.72-513. 100% | 71.9 MiB/s | 220.8 KiB | 00m00s [ 56/313] perl-Devel-SelfStubber-0:1.06 100% | 7.0 MiB/s | 14.3 KiB | 00m00s [ 57/313] perl-Digest-0:1.20-512.fc42.n 100% | 12.2 MiB/s | 24.9 KiB | 00m00s [ 58/313] perl-Digest-SHA-1:6.04-513.fc 100% | 60.7 MiB/s | 62.2 KiB | 00m00s [ 59/313] perl-Digest-MD5-0:2.59-6.fc42 100% | 11.7 MiB/s | 36.0 KiB | 00m00s [ 60/313] perl-DirHandle-0:1.05-515.fc4 100% | 6.1 MiB/s | 12.5 KiB | 00m00s [ 61/313] perl-Dumpvalue-0:2.27-515.fc4 100% | 6.0 MiB/s | 18.3 KiB | 00m00s [ 62/313] perl-English-0:1.11-515.fc42. 100% | 13.3 MiB/s | 13.6 KiB | 00m00s [ 63/313] perl-DynaLoader-0:1.56-515.fc 100% | 8.5 MiB/s | 26.0 KiB | 00m00s [ 64/313] perl-Env-0:1.06-512.fc42.noar 100% | 19.2 MiB/s | 19.7 KiB | 00m00s [ 65/313] perl-Errno-0:1.38-515.fc42.x8 100% | 14.6 MiB/s | 14.9 KiB | 00m00s [ 66/313] perl-ExtUtils-CBuilder-1:0.28 100% | 24.7 MiB/s | 50.6 KiB | 00m00s [ 67/313] perl-Exporter-0:5.78-512.fc42 100% | 15.1 MiB/s | 31.0 KiB | 00m00s [ 68/313] perl-ExtUtils-Command-2:7.70- 100% | 6.8 MiB/s | 14.0 KiB | 00m00s [ 69/313] perl-ExtUtils-Constant-0:0.25 100% | 21.3 MiB/s | 43.7 KiB | 00m00s [ 70/313] perl-ExtUtils-Embed-0:1.35-51 100% | 8.6 MiB/s | 17.7 KiB | 00m00s [ 71/313] perl-ExtUtils-Install-0:2.22- 100% | 14.1 MiB/s | 43.5 KiB | 00m00s [ 72/313] perl-ExtUtils-MakeMaker-2:7.7 100% | 95.3 MiB/s | 292.9 KiB | 00m00s [ 73/313] perl-ExtUtils-Manifest-1:1.75 100% | 16.7 MiB/s | 34.1 KiB | 00m00s [ 74/313] perl-ExtUtils-MM-Utils-2:7.70 100% | 2.8 MiB/s | 11.5 KiB | 00m00s [ 75/313] perl-ExtUtils-Miniperl-0:1.14 100% | 14.7 MiB/s | 15.0 KiB | 00m00s [ 76/313] perl-Fcntl-0:1.18-515.fc42.x8 100% | 14.6 MiB/s | 29.8 KiB | 00m00s [ 77/313] perl-ExtUtils-ParseXS-1:3.51- 100% | 60.9 MiB/s | 187.2 KiB | 00m00s [ 78/313] perl-File-DosGlob-0:1.12-515. 100% | 9.6 MiB/s | 19.6 KiB | 00m00s [ 79/313] perl-File-Compare-0:1.100.800 100% | 4.3 MiB/s | 13.3 KiB | 00m00s [ 80/313] perl-File-Fetch-0:1.04-512.fc 100% | 9.9 MiB/s | 30.5 KiB | 00m00s [ 81/313] perl-File-Find-0:1.44-515.fc4 100% | 12.4 MiB/s | 25.4 KiB | 00m00s [ 82/313] perl-File-Path-0:2.18-512.fc4 100% | 17.2 MiB/s | 35.2 KiB | 00m00s [ 83/313] perl-FileCache-0:1.10-515.fc4 100% | 14.4 MiB/s | 14.7 KiB | 00m00s [ 84/313] perl-File-Temp-1:0.231.100-51 100% | 19.3 MiB/s | 59.2 KiB | 00m00s [ 85/313] perl-File-stat-0:1.14-515.fc4 100% | 8.3 MiB/s | 17.1 KiB | 00m00s [ 86/313] perl-Filter-Simple-0:0.96-512 100% | 26.4 MiB/s | 27.0 KiB | 00m00s [ 87/313] perl-FileHandle-0:2.05-515.fc 100% | 7.6 MiB/s | 15.5 KiB | 00m00s [ 88/313] perl-Filter-2:1.64-513.fc42.x 100% | 21.0 MiB/s | 86.0 KiB | 00m00s [ 89/313] perl-FindBin-0:1.54-515.fc42. 100% | 6.9 MiB/s | 14.2 KiB | 00m00s [ 90/313] perl-GDBM_File-1:1.24-515.fc4 100% | 20.8 MiB/s | 42.6 KiB | 00m00s [ 91/313] perl-Hash-Util-0:0.32-515.fc4 100% | 33.7 MiB/s | 34.5 KiB | 00m00s [ 92/313] perl-Getopt-Long-1:2.58-3.fc4 100% | 31.1 MiB/s | 63.7 KiB | 00m00s [ 93/313] perl-HTTP-Tiny-0:0.090-2.fc42 100% | 18.4 MiB/s | 56.5 KiB | 00m00s [ 94/313] perl-Hash-Util-FieldHash-0:1. 100% | 38.0 MiB/s | 38.9 KiB | 00m00s [ 95/313] perl-I18N-Collate-0:1.02-515. 100% | 13.9 MiB/s | 14.2 KiB | 00m00s [ 96/313] perl-I18N-Langinfo-0:0.24-515 100% | 25.0 MiB/s | 25.6 KiB | 00m00s [ 97/313] perl-IO-0:1.55-515.fc42.x86_6 100% | 26.6 MiB/s | 81.7 KiB | 00m00s [ 98/313] perl-I18N-LangTags-0:0.45-515 100% | 17.1 MiB/s | 52.5 KiB | 00m00s [ 99/313] perl-IO-Compress-0:2.213-3.fc 100% | 99.5 MiB/s | 305.7 KiB | 00m00s [100/313] perl-IO-Socket-IP-0:0.43-2.fc 100% | 20.7 MiB/s | 42.4 KiB | 00m00s [101/313] perl-IO-Zlib-1:1.15-512.fc42. 100% | 6.4 MiB/s | 19.7 KiB | 00m00s [102/313] perl-IPC-SysV-0:2.09-513.fc42 100% | 39.9 MiB/s | 40.8 KiB | 00m00s [103/313] perl-IPC-Cmd-2:1.04-513.fc42. 100% | 12.9 MiB/s | 39.7 KiB | 00m00s [104/313] perl-IPC-Open3-0:1.22-515.fc4 100% | 5.3 MiB/s | 21.8 KiB | 00m00s [105/313] perl-JSON-PP-1:4.16-513.fc42. 100% | 32.0 MiB/s | 65.5 KiB | 00m00s [106/313] perl-Locale-Maketext-0:1.33-5 100% | 22.9 MiB/s | 93.7 KiB | 00m00s [107/313] perl-MIME-Base64-0:3.16-512.f 100% | 9.7 MiB/s | 29.9 KiB | 00m00s [108/313] perl-Locale-Maketext-Simple-1 100% | 5.7 MiB/s | 17.6 KiB | 00m00s [109/313] perl-Math-BigInt-FastCalc-0:0 100% | 27.5 MiB/s | 28.2 KiB | 00m00s [110/313] perl-Math-BigInt-1:2.0030.04- 100% | 73.8 MiB/s | 226.8 KiB | 00m00s [111/313] perl-Math-Complex-0:1.62-515. 100% | 15.0 MiB/s | 46.1 KiB | 00m00s [112/313] perl-Memoize-0:1.16-515.fc42. 100% | 22.6 MiB/s | 46.3 KiB | 00m00s [113/313] perl-Module-CoreList-tools-1: 100% | 6.1 MiB/s | 18.6 KiB | 00m00s [114/313] perl-Module-Load-1:0.36-512.f 100% | 8.4 MiB/s | 17.3 KiB | 00m00s [115/313] perl-Module-CoreList-1:5.2025 100% | 17.9 MiB/s | 91.5 KiB | 00m00s [116/313] perl-Module-Loaded-1:0.08-515 100% | 13.1 MiB/s | 13.4 KiB | 00m00s [117/313] perl-Module-Load-Conditional- 100% | 4.3 MiB/s | 22.0 KiB | 00m00s [118/313] perl-NDBM_File-0:1.17-515.fc4 100% | 11.1 MiB/s | 22.7 KiB | 00m00s [119/313] perl-Module-Metadata-0:1.0000 100% | 6.9 MiB/s | 35.4 KiB | 00m00s [120/313] perl-NEXT-0:0.69-515.fc42.noa 100% | 20.4 MiB/s | 20.9 KiB | 00m00s [121/313] perl-Net-0:1.04-515.fc42.noar 100% | 22.0 MiB/s | 22.6 KiB | 00m00s [122/313] perl-Net-Ping-0:2.76-512.fc42 100% | 24.2 MiB/s | 49.6 KiB | 00m00s [123/313] perl-ODBM_File-0:1.18-515.fc4 100% | 11.1 MiB/s | 22.7 KiB | 00m00s [124/313] perl-Opcode-0:1.65-515.fc42.x 100% | 17.5 MiB/s | 35.9 KiB | 00m00s [125/313] perl-Params-Check-1:0.38-512. 100% | 10.7 MiB/s | 21.8 KiB | 00m00s [126/313] perl-POSIX-0:2.20-515.fc42.x8 100% | 31.8 MiB/s | 97.7 KiB | 00m00s [127/313] perl-PerlIO-via-QuotedPrint-0 100% | 21.2 MiB/s | 21.7 KiB | 00m00s [128/313] perl-Pod-Checker-4:1.77-512.f 100% | 31.0 MiB/s | 31.8 KiB | 00m00s [129/313] perl-Perl-OSType-0:1.010-513. 100% | 5.6 MiB/s | 22.8 KiB | 00m00s [130/313] perl-Pod-Functions-0:1.14-515 100% | 14.3 MiB/s | 14.7 KiB | 00m00s [131/313] perl-Pod-Html-0:1.35-515.fc42 100% | 28.7 MiB/s | 29.4 KiB | 00m00s [132/313] perl-Pod-Escapes-1:1.07-512.f 100% | 9.7 MiB/s | 19.8 KiB | 00m00s [133/313] perl-Pod-Usage-4:2.03-512.fc4 100% | 39.1 MiB/s | 40.0 KiB | 00m00s [134/313] perl-Pod-Perldoc-0:3.28.01-51 100% | 27.9 MiB/s | 85.8 KiB | 00m00s [135/313] perl-Pod-Simple-1:3.45-512.fc 100% | 71.3 MiB/s | 219.0 KiB | 00m00s [136/313] perl-Safe-0:2.46-515.fc42.noa 100% | 12.2 MiB/s | 24.9 KiB | 00m00s [137/313] perl-Search-Dict-0:1.07-515.f 100% | 12.7 MiB/s | 13.0 KiB | 00m00s [138/313] perl-SelectSaver-0:1.02-515.f 100% | 5.7 MiB/s | 11.7 KiB | 00m00s [139/313] perl-SelfLoader-0:1.27-515.fc 100% | 10.5 MiB/s | 21.6 KiB | 00m00s [140/313] perl-Socket-4:2.038-512.fc42. 100% | 17.8 MiB/s | 54.8 KiB | 00m00s [141/313] perl-Storable-1:3.32-512.fc42 100% | 32.4 MiB/s | 99.6 KiB | 00m00s [142/313] perl-Symbol-0:1.09-515.fc42.n 100% | 6.9 MiB/s | 14.2 KiB | 00m00s [143/313] perl-Sys-Hostname-0:1.25-515. 100% | 16.7 MiB/s | 17.1 KiB | 00m00s [144/313] perl-Sys-Syslog-0:0.36-513.fc 100% | 22.8 MiB/s | 46.6 KiB | 00m00s [145/313] perl-Term-Cap-0:1.18-512.fc42 100% | 21.6 MiB/s | 22.2 KiB | 00m00s [146/313] perl-Term-ANSIColor-0:5.01-51 100% | 23.3 MiB/s | 47.7 KiB | 00m00s [147/313] perl-Term-ReadLine-0:1.17-515 100% | 18.6 MiB/s | 19.1 KiB | 00m00s [148/313] perl-Term-Table-0:0.024-2.fc4 100% | 21.0 MiB/s | 43.1 KiB | 00m00s [149/313] perl-Term-Complete-0:1.403-51 100% | 4.2 MiB/s | 13.0 KiB | 00m00s [150/313] perl-Test-0:1.31-515.fc42.noa 100% | 13.9 MiB/s | 28.6 KiB | 00m00s [151/313] perl-Test-Harness-1:3.50-2.fc 100% | 90.2 MiB/s | 277.1 KiB | 00m00s [152/313] perl-Text-Abbrev-0:1.02-515.f 100% | 5.9 MiB/s | 12.2 KiB | 00m00s [153/313] perl-Text-Balanced-0:2.06-512 100% | 47.7 MiB/s | 48.8 KiB | 00m00s [154/313] perl-Test-Simple-3:1.302209-1 100% | 105.1 MiB/s | 860.7 KiB | 00m00s [155/313] perl-Text-ParseWords-0:3.31-5 100% | 4.0 MiB/s | 16.5 KiB | 00m00s [156/313] perl-Text-Tabs+Wrap-0:2024.00 100% | 7.1 MiB/s | 21.8 KiB | 00m00s [157/313] perl-Thread-0:3.05-515.fc42.n 100% | 8.8 MiB/s | 18.0 KiB | 00m00s [158/313] perl-Thread-Semaphore-0:2.13- 100% | 7.6 MiB/s | 15.6 KiB | 00m00s [159/313] perl-Thread-Queue-0:3.14-512. 100% | 7.0 MiB/s | 21.4 KiB | 00m00s [160/313] perl-Tie-0:4.6-515.fc42.noarc 100% | 13.5 MiB/s | 27.7 KiB | 00m00s [161/313] perl-Tie-File-0:1.09-515.fc42 100% | 21.1 MiB/s | 43.3 KiB | 00m00s [162/313] perl-Tie-Memoize-0:1.1-515.fc 100% | 6.9 MiB/s | 14.1 KiB | 00m00s [163/313] perl-Tie-RefHash-0:1.41-2.fc4 100% | 23.0 MiB/s | 23.6 KiB | 00m00s [164/313] perl-Time-0:1.04-515.fc42.noa 100% | 16.4 MiB/s | 16.7 KiB | 00m00s [165/313] perl-Time-HiRes-4:1.9777-512. 100% | 28.1 MiB/s | 57.5 KiB | 00m00s [166/313] perl-Time-Piece-0:1.3401-515. 100% | 19.6 MiB/s | 40.2 KiB | 00m00s [167/313] perl-Time-Local-2:1.350-512.f 100% | 16.8 MiB/s | 34.5 KiB | 00m00s [168/313] perl-Unicode-Collate-0:1.31-5 100% | 210.1 MiB/s | 645.6 KiB | 00m00s [169/313] perl-Unicode-UCD-0:0.78-515.f 100% | 25.5 MiB/s | 78.3 KiB | 00m00s [170/313] perl-Unicode-Normalize-0:1.32 100% | 18.1 MiB/s | 74.1 KiB | 00m00s [171/313] perl-autodie-0:2.37-513.fc42. 100% | 47.3 MiB/s | 96.9 KiB | 00m00s [172/313] perl-autouse-0:1.11-515.fc42. 100% | 13.5 MiB/s | 13.8 KiB | 00m00s [173/313] perl-User-pwent-0:1.05-515.fc 100% | 6.4 MiB/s | 19.5 KiB | 00m00s [174/313] perl-bignum-0:0.67-513.fc42.n 100% | 47.8 MiB/s | 49.0 KiB | 00m00s [175/313] perl-base-0:2.27-515.fc42.noa 100% | 15.8 MiB/s | 16.2 KiB | 00m00s [176/313] perl-blib-0:1.07-515.fc42.noa 100% | 12.1 MiB/s | 12.4 KiB | 00m00s [177/313] perl-constant-0:1.33-513.fc42 100% | 22.4 MiB/s | 23.0 KiB | 00m00s [178/313] perl-debugger-0:1.60-515.fc42 100% | 65.0 MiB/s | 133.1 KiB | 00m00s [179/313] perl-diagnostics-0:1.40-515.f 100% | 106.3 MiB/s | 217.6 KiB | 00m00s [180/313] perl-deprecate-0:0.04-515.fc4 100% | 3.6 MiB/s | 14.6 KiB | 00m00s [181/313] perl-devel-4:5.40.1-515.fc42. 100% | 186.6 MiB/s | 764.3 KiB | 00m00s [182/313] perl-encoding-4:3.00-512.fc42 100% | 61.5 MiB/s | 63.0 KiB | 00m00s [183/313] perl-encoding-warnings-0:0.14 100% | 16.2 MiB/s | 16.6 KiB | 00m00s [184/313] perl-experimental-0:0.034-2.f 100% | 26.4 MiB/s | 27.0 KiB | 00m00s [185/313] perl-fields-0:2.27-515.fc42.n 100% | 15.8 MiB/s | 16.1 KiB | 00m00s [186/313] perl-filetest-0:1.03-515.fc42 100% | 14.3 MiB/s | 14.6 KiB | 00m00s [187/313] perl-if-0:0.61.000-515.fc42.n 100% | 13.7 MiB/s | 14.0 KiB | 00m00s [188/313] perl-less-0:0.03-515.fc42.noa 100% | 12.9 MiB/s | 13.2 KiB | 00m00s [189/313] perl-lib-0:0.65-515.fc42.x86_ 100% | 14.6 MiB/s | 15.0 KiB | 00m00s [190/313] perl-libnet-0:3.15-513.fc42.n 100% | 125.4 MiB/s | 128.4 KiB | 00m00s [191/313] perl-libnetcfg-4:5.40.1-515.f 100% | 15.9 MiB/s | 16.3 KiB | 00m00s [192/313] perl-locale-0:1.12-515.fc42.n 100% | 13.3 MiB/s | 13.6 KiB | 00m00s [193/313] perl-macros-4:5.40.1-515.fc42 100% | 4.0 MiB/s | 12.3 KiB | 00m00s [194/313] perl-libs-4:5.40.1-515.fc42.x 100% | 259.6 MiB/s | 2.3 MiB | 00m00s [195/313] perl-meta-notation-0:5.40.1-5 100% | 3.5 MiB/s | 10.7 KiB | 00m00s [196/313] perl-open-0:1.13-515.fc42.noa 100% | 8.1 MiB/s | 16.5 KiB | 00m00s [197/313] perl-mro-0:1.29-515.fc42.x86_ 100% | 14.6 MiB/s | 29.9 KiB | 00m00s [198/313] perl-overloading-0:0.02-515.f 100% | 12.6 MiB/s | 12.9 KiB | 00m00s [199/313] perl-overload-0:1.37-515.fc42 100% | 22.2 MiB/s | 45.5 KiB | 00m00s [200/313] perl-parent-1:0.244-2.fc42.no 100% | 14.9 MiB/s | 15.2 KiB | 00m00s [201/313] perl-ph-0:5.40.1-515.fc42.x86 100% | 47.6 MiB/s | 48.8 KiB | 00m00s [202/313] perl-perlfaq-0:5.20240218-512 100% | 92.4 MiB/s | 378.4 KiB | 00m00s [203/313] perl-podlators-1:6.0.2-3.fc42 100% | 62.8 MiB/s | 128.6 KiB | 00m00s [204/313] perl-sigtrap-0:1.10-515.fc42. 100% | 15.3 MiB/s | 15.7 KiB | 00m00s [205/313] perl-sort-0:2.05-515.fc42.noa 100% | 12.9 MiB/s | 13.2 KiB | 00m00s [206/313] perl-subs-0:1.04-515.fc42.noa 100% | 11.4 MiB/s | 11.7 KiB | 00m00s [207/313] perl-threads-1:2.40-512.fc42. 100% | 56.7 MiB/s | 58.0 KiB | 00m00s [208/313] perl-utils-0:5.40.1-515.fc42. 100% | 51.1 MiB/s | 52.3 KiB | 00m00s [209/313] perl-threads-shared-0:1.69-51 100% | 21.7 MiB/s | 44.5 KiB | 00m00s [210/313] perl-vars-0:1.05-515.fc42.noa 100% | 12.7 MiB/s | 13.0 KiB | 00m00s [211/313] perl-version-9:0.99.33-2.fc42 100% | 61.5 MiB/s | 63.0 KiB | 00m00s [212/313] perl-vmsish-0:1.04-515.fc42.n 100% | 13.8 MiB/s | 14.1 KiB | 00m00s [213/313] perl-MIME-Base32-0:1.303-23.f 100% | 20.0 MiB/s | 20.5 KiB | 00m00s [214/313] less-0:668-2.fc42.x86_64 100% | 61.8 MiB/s | 190.0 KiB | 00m00s [215/313] man-db-0:2.13.0-2.fc42.x86_64 100% | 262.6 MiB/s | 1.3 MiB | 00m00s [216/313] libdrm-0:2.4.124-2.fc42.x86_6 100% | 52.4 MiB/s | 161.0 KiB | 00m00s [217/313] numactl-libs-0:2.0.19-2.fc42. 100% | 15.3 MiB/s | 31.3 KiB | 00m00s [218/313] perl-IO-Compress-Lzma-0:2.213 100% | 74.9 MiB/s | 76.7 KiB | 00m00s [219/313] perl-Text-Diff-0:1.45-23.fc42 100% | 39.2 MiB/s | 40.1 KiB | 00m00s [220/313] perl-Archive-Zip-0:1.68-16.fc 100% | 108.9 MiB/s | 111.5 KiB | 00m00s [221/313] perl-Devel-Size-0:0.84-4.fc42 100% | 15.0 MiB/s | 30.6 KiB | 00m00s [222/313] perl-Compress-Bzip2-0:2.28-21 100% | 32.8 MiB/s | 67.1 KiB | 00m00s [223/313] perl-File-HomeDir-0:1.006-14. 100% | 57.9 MiB/s | 59.3 KiB | 00m00s [224/313] perl-Module-Build-2:0.42.34-8 100% | 122.8 MiB/s | 251.5 KiB | 00m00s [225/313] perl-Module-Signature-0:0.89- 100% | 42.1 MiB/s | 86.2 KiB | 00m00s [226/313] perl-Text-Glob-0:0.11-25.fc42 100% | 13.1 MiB/s | 13.4 KiB | 00m00s [227/313] perl-local-lib-0:2.000029-9.f 100% | 32.4 MiB/s | 66.3 KiB | 00m00s [228/313] libdb-0:5.3.28-64.fc42.x86_64 100% | 252.6 MiB/s | 775.9 KiB | 00m00s [229/313] perl-IO-Socket-SSL-0:2.089-2. 100% | 112.4 MiB/s | 230.2 KiB | 00m00s [230/313] perl-Net-SSLeay-0:1.94-8.fc42 100% | 183.6 MiB/s | 376.0 KiB | 00m00s [231/313] ncurses-0:6.5-5.20250125.fc42 100% | 138.2 MiB/s | 424.5 KiB | 00m00s [232/313] groff-base-0:1.23.0-8.fc42.x8 100% | 184.1 MiB/s | 1.1 MiB | 00m00s [233/313] perl-IPC-System-Simple-0:1.30 100% | 37.9 MiB/s | 38.8 KiB | 00m00s [234/313] libxcrypt-devel-0:4.4.38-6.fc 100% | 14.3 MiB/s | 29.3 KiB | 00m00s [235/313] systemtap-sdt-dtrace-0:5.3~pr 100% | 33.8 MiB/s | 69.2 KiB | 00m00s [236/313] libpipeline-0:1.5.8-2.fc42.x8 100% | 29.3 MiB/s | 60.0 KiB | 00m00s [237/313] libpciaccess-0:0.16-15.fc42.x 100% | 12.8 MiB/s | 26.3 KiB | 00m00s [238/313] perl-Compress-Raw-Lzma-0:2.21 100% | 25.4 MiB/s | 52.0 KiB | 00m00s [239/313] perl-Algorithm-Diff-0:1.2010- 100% | 22.6 MiB/s | 46.4 KiB | 00m00s [240/313] perl-inc-latest-2:0.500-30.fc 100% | 11.4 MiB/s | 23.3 KiB | 00m00s [241/313] perl-Software-License-0:0.104 100% | 73.3 MiB/s | 150.1 KiB | 00m00s [242/313] glibc-devel-0:2.40.9000-35.fc 100% | 319.0 MiB/s | 653.3 KiB | 00m00s [243/313] python3-pyparsing-0:3.1.2-8.f 100% | 54.3 MiB/s | 278.0 KiB | 00m00s [244/313] gnupg2-0:2.4.7-2.fc42.x86_64 100% | 213.6 MiB/s | 2.8 MiB | 00m00s [245/313] hwdata-0:0.392-1.fc42.noarch 100% | 233.4 MiB/s | 1.6 MiB | 00m00s [246/313] perl-Data-Section-0:0.200008- 100% | 8.1 MiB/s | 24.9 KiB | 00m00s [247/313] perl-Text-Template-0:1.61-7.f 100% | 57.7 MiB/s | 59.1 KiB | 00m00s [248/313] libassuan-0:2.5.7-3.fc42.x86_ 100% | 33.0 MiB/s | 67.6 KiB | 00m00s [249/313] gnutls-0:3.8.9-2.fc42.x86_64 100% | 178.6 MiB/s | 1.2 MiB | 00m00s [250/313] perl-doc-0:5.40.1-515.fc42.no 100% | 46.5 MiB/s | 4.9 MiB | 00m00s [251/313] libgcrypt-0:1.11.0-5.fc42.x86 100% | 72.4 MiB/s | 593.3 KiB | 00m00s [252/313] libgpg-error-0:1.51-2.fc42.x8 100% | 38.6 MiB/s | 237.2 KiB | 00m00s [253/313] libksba-0:1.6.7-3.fc42.x86_64 100% | 52.7 MiB/s | 162.0 KiB | 00m00s [254/313] npth-0:1.8-2.fc42.x86_64 100% | 12.6 MiB/s | 25.8 KiB | 00m00s [255/313] perl-MRO-Compat-0:0.15-11.fc4 100% | 24.8 MiB/s | 25.4 KiB | 00m00s [256/313] tpm2-tss-0:4.1.3-6.fc42.x86_6 100% | 103.9 MiB/s | 425.4 KiB | 00m00s [257/313] perl-Sub-Exporter-0:0.991-5.f 100% | 37.9 MiB/s | 77.6 KiB | 00m00s [258/313] nettle-0:3.10.1-1.fc42.x86_64 100% | 138.2 MiB/s | 424.4 KiB | 00m00s [259/313] perl-Data-OptList-0:0.114-6.f 100% | 13.1 MiB/s | 26.8 KiB | 00m00s [260/313] libusb1-0:1.0.27-9.fc42.x86_6 100% | 37.8 MiB/s | 77.5 KiB | 00m00s [261/313] perl-Package-Generator-0:1.10 100% | 21.9 MiB/s | 22.4 KiB | 00m00s [262/313] perl-Params-Util-0:1.102-17.f 100% | 31.9 MiB/s | 32.7 KiB | 00m00s [263/313] perl-Sub-Install-0:0.929-7.fc 100% | 22.1 MiB/s | 22.6 KiB | 00m00s [264/313] python3-0:3.13.2-2.fc42.x86_6 100% | 27.7 MiB/s | 28.4 KiB | 00m00s [265/313] libb2-0:0.98.1-13.fc42.x86_64 100% | 4.1 MiB/s | 25.4 KiB | 00m00s [266/313] mpdecimal-0:4.0.0-2.fc42.x86_ 100% | 13.5 MiB/s | 97.0 KiB | 00m00s [267/313] python3-libs-0:3.13.2-2.fc42. 100% | 367.2 MiB/s | 9.2 MiB | 00m00s [268/313] python-pip-wheel-0:24.3.1-2.f 100% | 92.6 MiB/s | 1.2 MiB | 00m00s [269/313] tzdata-0:2025a-1.fc42.noarch 100% | 232.2 MiB/s | 713.3 KiB | 00m00s [270/313] hipcc-0:18-37.rocm6.3.1.fc42. 100% | 31.6 MiB/s | 129.4 KiB | 00m00s [271/313] rocm-hip-0:6.3.1-3.fc42.x86_6 100% | 217.0 MiB/s | 9.3 MiB | 00m00s [272/313] rocm-core-0:6.3.1-2.fc42.x86_ 100% | 1.0 MiB/s | 13.8 KiB | 00m00s [273/313] rocm-smi-0:6.3.1-3.fc42.x86_6 100% | 6.4 MiB/s | 574.6 KiB | 00m00s [274/313] rocm-device-libs-0:18-37.rocm 100% | 79.3 MiB/s | 487.3 KiB | 00m00s [275/313] perl-Encode-4:3.21-512.fc42.x 100% | 210.5 MiB/s | 1.1 MiB | 00m00s [276/313] systemtap-sdt-devel-0:5.3~pre 100% | 22.3 MiB/s | 68.6 KiB | 00m00s [277/313] rocm-comgr-0:18-37.rocm6.3.1. 100% | 283.6 MiB/s | 29.5 MiB | 00m00s [278/313] libmpc-0:1.3.1-7.fc42.x86_64 100% | 3.0 MiB/s | 70.9 KiB | 00m00s [279/313] cpp-0:15.0.1-0.9.fc42.x86_64 100% | 148.0 MiB/s | 12.7 MiB | 00m00s [280/313] perl-Encode-devel-4:3.21-512. 100% | 3.1 MiB/s | 41.1 KiB | 00m00s [281/313] kernel-headers-0:6.14.0-0.rc3 100% | 48.6 MiB/s | 1.7 MiB | 00m00s [282/313] gcc-0:15.0.1-0.9.fc42.x86_64 100% | 211.8 MiB/s | 38.8 MiB | 00m00s [283/313] gcc-c++-0:15.0.1-0.9.fc42.x86 100% | 103.5 MiB/s | 15.0 MiB | 00m00s [284/313] libstdc++-devel-0:15.0.1-0.9. 100% | 34.4 MiB/s | 2.8 MiB | 00m00s [285/313] procps-ng-0:4.0.4-6.fc42.x86_ 100% | 10.2 MiB/s | 365.3 KiB | 00m00s [286/313] libtommath-0:1.3.1~rc1-5.fc42 100% | 21.0 MiB/s | 64.4 KiB | 00m00s [287/313] tcl-1:9.0.0-7.fc42.x86_64 100% | 137.5 MiB/s | 1.2 MiB | 00m00s [288/313] rocm-clang-devel-0:18-37.rocm 100% | 193.3 MiB/s | 2.3 MiB | 00m00s [289/313] rocm-lld-0:18-37.rocm6.3.1.fc 100% | 111.1 MiB/s | 1.4 MiB | 00m00s [290/313] git-0:2.48.1-3.fc42.x86_64 100% | 25.2 MiB/s | 51.6 KiB | 00m00s [291/313] git-core-0:2.48.1-3.fc42.x86_ 100% | 286.8 MiB/s | 4.9 MiB | 00m00s [292/313] git-core-doc-0:2.48.1-3.fc42. 100% | 272.5 MiB/s | 3.0 MiB | 00m00s [293/313] perl-Git-0:2.48.1-3.fc42.noar 100% | 18.7 MiB/s | 38.3 KiB | 00m00s [294/313] perl-TermReadKey-0:2.38-24.fc 100% | 34.6 MiB/s | 35.4 KiB | 00m00s [295/313] openssh-clients-0:9.9p1-7.fc4 100% | 188.3 MiB/s | 771.4 KiB | 00m00s [296/313] perl-Error-1:0.17030-1.fc42.n 100% | 39.4 MiB/s | 40.4 KiB | 00m00s [297/313] libfido2-0:1.15.0-3.fc42.x86_ 100% | 48.0 MiB/s | 98.4 KiB | 00m00s [298/313] openssh-0:9.9p1-7.fc42.x86_64 100% | 115.7 MiB/s | 355.5 KiB | 00m00s [299/313] libcbor-0:0.11.0-3.fc42.x86_6 100% | 32.5 MiB/s | 33.3 KiB | 00m00s [300/313] rocm-clang-0:18-37.rocm6.3.1. 100% | 162.7 MiB/s | 21.6 MiB | 00m00s [301/313] rocm-llvm-static-0:18-37.rocm 100% | 154.6 MiB/s | 27.5 MiB | 00m00s [302/313] rocm-clang-runtime-devel-0:18 100% | 9.3 MiB/s | 486.4 KiB | 00m00s [303/313] rocm-clang-libs-0:18-37.rocm6 100% | 148.0 MiB/s | 22.3 MiB | 00m00s [304/313] rocm-libc++-devel-0:18-37.roc 100% | 22.0 MiB/s | 833.6 KiB | 00m00s [305/313] rocm-libc++-0:18-37.rocm6.3.1 100% | 43.3 MiB/s | 354.7 KiB | 00m00s [306/313] rocm-llvm-filesystem-0:18-37. 100% | 5.4 MiB/s | 22.0 KiB | 00m00s [307/313] rocm-llvm-devel-0:18-37.rocm6 100% | 82.3 MiB/s | 3.6 MiB | 00m00s [308/313] annobin-plugin-gcc-0:12.88-1. 100% | 239.7 MiB/s | 981.9 KiB | 00m00s [309/313] gcc-plugin-annobin-0:15.0.1-0 100% | 20.7 MiB/s | 42.4 KiB | 00m00s [310/313] annobin-docs-0:12.88-1.fc42.n 100% | 89.5 MiB/s | 91.7 KiB | 00m00s [311/313] cmake-rpm-macros-0:3.31.5-1.f 100% | 8.3 MiB/s | 17.0 KiB | 00m00s [312/313] rocm-llvm-0:18-37.rocm6.3.1.f 100% | 160.8 MiB/s | 16.1 MiB | 00m00s [313/313] rocm-llvm-libs-0:18-37.rocm6. 100% | 124.3 MiB/s | 19.6 MiB | 00m00s -------------------------------------------------------------------------------- [313/313] Total 100% | 295.3 MiB/s | 362.0 MiB | 00m01s Running transaction [ 1/315] Verify package files 100% | 266.0 B/s | 313.0 B | 00m01s [ 2/315] Prepare transaction 100% | 2.9 KiB/s | 313.0 B | 00m00s [ 3/315] Installing cmake-filesystem-0 100% | 7.4 MiB/s | 7.6 KiB | 00m00s [ 4/315] Installing libgpg-error-0:1.5 100% | 54.9 MiB/s | 900.0 KiB | 00m00s [ 5/315] Installing libmpc-0:1.3.1-7.f 100% | 162.2 MiB/s | 166.1 KiB | 00m00s [ 6/315] Installing less-0:668-2.fc42. 100% | 28.5 MiB/s | 409.1 KiB | 00m00s [ 7/315] Installing make-1:4.4.1-10.fc 100% | 105.9 MiB/s | 1.8 MiB | 00m00s [ 8/315] Installing expat-0:2.6.4-2.fc 100% | 22.2 MiB/s | 294.9 KiB | 00m00s [ 9/315] Installing rocm-llvm-filesyst 100% | 6.8 MiB/s | 13.9 KiB | 00m00s [ 10/315] Installing rocm-libc++-0:18-3 100% | 72.8 MiB/s | 1.5 MiB | 00m00s [ 11/315] Installing rocm-llvm-libs-0:1 100% | 79.8 MiB/s | 93.8 MiB | 00m01s [ 12/315] Installing rocm-clang-libs-0: 100% | 84.1 MiB/s | 113.9 MiB | 00m01s [ 13/315] Installing rocm-comgr-0:18-37 100% | 78.3 MiB/s | 137.1 MiB | 00m02s [ 14/315] Installing groff-base-0:1.23. 100% | 118.0 MiB/s | 3.9 MiB | 00m00s [ 15/315] Installing numactl-libs-0:2.0 100% | 52.5 MiB/s | 53.8 KiB | 00m00s [ 16/315] Installing libedit-0:3.1-55.2 100% | 240.0 MiB/s | 245.8 KiB | 00m00s [ 17/315] Installing vim-filesystem-2:9 100% | 4.6 MiB/s | 4.7 KiB | 00m00s [ 18/315] Installing rocm-lld-0:18-37.r 100% | 75.0 MiB/s | 6.5 MiB | 00m00s [ 19/315] Installing rocm-libc++-devel- 100% | 100.1 MiB/s | 7.2 MiB | 00m00s [ 20/315] Installing cpp-0:15.0.1-0.9.f 100% | 361.5 MiB/s | 37.6 MiB | 00m00s [ 21/315] Installing libassuan-0:2.5.7- 100% | 165.6 MiB/s | 169.6 KiB | 00m00s [ 22/315] Installing libgcrypt-0:1.11.0 100% | 392.3 MiB/s | 1.6 MiB | 00m00s [ 23/315] Installing libksba-0:1.6.7-3. 100% | 197.8 MiB/s | 405.1 KiB | 00m00s [ 24/315] Installing annobin-docs-0:12. 100% | 97.4 MiB/s | 99.8 KiB | 00m00s [ 25/315] Installing rocm-clang-runtime 100% | 140.9 MiB/s | 6.9 MiB | 00m00s [ 26/315] Installing libcbor-0:0.11.0-3 100% | 77.3 MiB/s | 79.2 KiB | 00m00s [ 27/315] Installing libfido2-0:1.15.0- 100% | 237.9 MiB/s | 243.6 KiB | 00m00s [ 28/315] Installing openssh-0:9.9p1-7. 100% | 86.3 MiB/s | 1.4 MiB | 00m00s [ 29/315] Installing openssh-clients-0: 100% | 117.6 MiB/s | 2.7 MiB | 00m00s [ 30/315] Installing git-core-0:2.48.1- 100% | 355.4 MiB/s | 22.7 MiB | 00m00s [ 31/315] Installing git-core-doc-0:2.4 100% | 382.7 MiB/s | 17.6 MiB | 00m00s [ 32/315] Installing libtommath-0:1.3.1 100% | 128.4 MiB/s | 131.5 KiB | 00m00s [ 33/315] Installing tcl-1:9.0.0-7.fc42 100% | 166.7 MiB/s | 4.3 MiB | 00m00s [ 34/315] Installing procps-ng-0:4.0.4- 100% | 56.1 MiB/s | 1.0 MiB | 00m00s [ 35/315] Installing libstdc++-devel-0: 100% | 390.5 MiB/s | 16.0 MiB | 00m00s [ 36/315] Installing kernel-headers-0:6 100% | 215.3 MiB/s | 6.7 MiB | 00m00s [ 37/315] Installing glibc-devel-0:2.40 100% | 179.5 MiB/s | 2.3 MiB | 00m00s [ 38/315] Installing libxcrypt-devel-0: 100% | 32.3 MiB/s | 33.1 KiB | 00m00s [ 39/315] Installing gcc-0:15.0.1-0.9.f 100% | 419.2 MiB/s | 110.2 MiB | 00m00s [ 40/315] Installing gcc-c++-0:15.0.1-0 100% | 361.4 MiB/s | 40.8 MiB | 00m00s [ 41/315] Installing systemtap-sdt-deve 100% | 179.7 MiB/s | 184.0 KiB | 00m00s [ 42/315] Installing rocm-core-0:6.3.1- 100% | 3.3 MiB/s | 13.5 KiB | 00m00s [ 43/315] Installing tzdata-0:2025a-1.f 100% | 62.8 MiB/s | 1.9 MiB | 00m00s [ 44/315] Installing python-pip-wheel-0 100% | 622.1 MiB/s | 1.2 MiB | 00m00s [ 45/315] Installing mpdecimal-0:4.0.0- 100% | 213.2 MiB/s | 218.4 KiB | 00m00s [ 46/315] Installing libb2-0:0.98.1-13. 100% | 9.2 MiB/s | 47.2 KiB | 00m00s [ 47/315] Installing python3-libs-0:3.1 100% | 356.5 MiB/s | 40.3 MiB | 00m00s [ 48/315] Installing python3-0:3.13.2-2 100% | 2.2 MiB/s | 29.4 KiB | 00m00s [ 49/315] Installing cmake-rpm-macros-0 100% | 0.0 B/s | 8.3 KiB | 00m00s [ 50/315] Installing python3-pyparsing- 100% | 327.1 MiB/s | 1.0 MiB | 00m00s [ 51/315] Installing systemtap-sdt-dtra 100% | 13.6 MiB/s | 180.4 KiB | 00m00s [ 52/315] Installing rocm-smi-0:6.3.1-3 100% | 138.8 MiB/s | 2.5 MiB | 00m00s [ 53/315] Installing rocm-llvm-0:18-37. 100% | 80.6 MiB/s | 79.3 MiB | 00m01s [ 54/315] Installing rocm-llvm-devel-0: 100% | 97.3 MiB/s | 24.7 MiB | 00m00s [ 55/315] Installing rocm-llvm-static-0 100% | 105.8 MiB/s | 233.9 MiB | 00m02s [ 56/315] Installing libusb1-0:1.0.27-9 100% | 12.6 MiB/s | 168.2 KiB | 00m00s >>> Running unknown scriptlet: tpm2-tss-0:4.1.3-6.fc42.x86_64 >>> Finished unknown scriptlet: tpm2-tss-0:4.1.3-6.fc42.x86_64 >>> Scriptlet output: >>> Creating group 'tss' with GID 59. >>> Creating user 'tss' (Account used for TPM access) with UID 59 and GID 59. >>> [ 57/315] Installing tpm2-tss-0:4.1.3-6 100% | 261.3 MiB/s | 1.6 MiB | 00m00s [ 58/315] Installing nettle-0:3.10.1-1. 100% | 258.3 MiB/s | 793.6 KiB | 00m00s [ 59/315] Installing gnutls-0:3.8.9-2.f 100% | 360.1 MiB/s | 3.6 MiB | 00m00s [ 60/315] Installing npth-0:1.8-2.fc42. 100% | 49.5 MiB/s | 50.7 KiB | 00m00s [ 61/315] Installing gnupg2-0:2.4.7-2.f 100% | 239.1 MiB/s | 9.8 MiB | 00m00s [ 62/315] Installing hwdata-0:0.392-1.f 100% | 553.5 MiB/s | 9.4 MiB | 00m00s [ 63/315] Installing libpciaccess-0:0.1 100% | 44.8 MiB/s | 45.9 KiB | 00m00s [ 64/315] Installing libdrm-0:2.4.124-2 100% | 201.1 MiB/s | 411.8 KiB | 00m00s [ 65/315] Installing rocm-runtime-0:6.3 100% | 482.5 MiB/s | 2.9 MiB | 00m00s [ 66/315] Installing rocm-runtime-devel 100% | 555.8 MiB/s | 569.2 KiB | 00m00s [ 67/315] Installing libpipeline-0:1.5. 100% | 14.3 MiB/s | 146.6 KiB | 00m00s [ 68/315] Installing man-db-0:2.13.0-2. 100% | 83.8 MiB/s | 2.8 MiB | 00m00s [ 69/315] Installing environment-module 100% | 64.4 MiB/s | 1.8 MiB | 00m00s [ 70/315] Installing ncurses-0:6.5-5.20 100% | 37.5 MiB/s | 614.7 KiB | 00m00s [ 71/315] Installing perl-Digest-0:1.20 100% | 36.2 MiB/s | 37.1 KiB | 00m00s [ 72/315] Installing perl-Digest-MD5-0: 100% | 60.1 MiB/s | 61.6 KiB | 00m00s [ 73/315] Installing perl-B-0:1.89-515. 100% | 244.8 MiB/s | 501.3 KiB | 00m00s [ 74/315] Installing perl-FileHandle-0: 100% | 0.0 B/s | 9.8 KiB | 00m00s [ 75/315] Installing perl-Data-Dumper-0 100% | 114.7 MiB/s | 117.5 KiB | 00m00s [ 76/315] Installing perl-libnet-0:3.15 100% | 143.9 MiB/s | 294.7 KiB | 00m00s [ 77/315] Installing perl-MIME-Base32-0 100% | 0.0 B/s | 32.2 KiB | 00m00s [ 78/315] Installing perl-AutoLoader-0: 100% | 0.0 B/s | 20.9 KiB | 00m00s [ 79/315] Installing perl-IO-Socket-IP- 100% | 99.8 MiB/s | 102.2 KiB | 00m00s [ 80/315] Installing perl-URI-0:5.31-2. 100% | 87.8 MiB/s | 269.6 KiB | 00m00s [ 81/315] Installing perl-Text-Tabs+Wra 100% | 0.0 B/s | 23.9 KiB | 00m00s [ 82/315] Installing perl-if-0:0.61.000 100% | 0.0 B/s | 6.2 KiB | 00m00s [ 83/315] Installing perl-locale-0:1.12 100% | 0.0 B/s | 6.9 KiB | 00m00s [ 84/315] Installing perl-Time-Local-2: 100% | 68.9 MiB/s | 70.6 KiB | 00m00s [ 85/315] Installing perl-File-Path-0:2 100% | 0.0 B/s | 64.5 KiB | 00m00s [ 86/315] Installing perl-Pod-Escapes-1 100% | 0.0 B/s | 25.9 KiB | 00m00s [ 87/315] Installing perl-IO-Socket-SSL 100% | 230.3 MiB/s | 707.4 KiB | 00m00s [ 88/315] Installing perl-Net-SSLeay-0: 100% | 271.7 MiB/s | 1.4 MiB | 00m00s [ 89/315] Installing perl-Class-Struct- 100% | 0.0 B/s | 25.9 KiB | 00m00s [ 90/315] Installing perl-Term-ANSIColo 100% | 96.9 MiB/s | 99.2 KiB | 00m00s [ 91/315] Installing perl-POSIX-0:2.20- 100% | 226.7 MiB/s | 232.2 KiB | 00m00s [ 92/315] Installing perl-IPC-Open3-0:1 100% | 0.0 B/s | 23.3 KiB | 00m00s [ 93/315] Installing perl-File-Temp-1:0 100% | 160.2 MiB/s | 164.1 KiB | 00m00s [ 94/315] Installing perl-Term-Cap-0:1. 100% | 0.0 B/s | 30.6 KiB | 00m00s [ 95/315] Installing perl-HTTP-Tiny-0:0 100% | 152.8 MiB/s | 156.4 KiB | 00m00s [ 96/315] Installing perl-Pod-Simple-1: 100% | 278.5 MiB/s | 570.4 KiB | 00m00s [ 97/315] Installing perl-Socket-4:2.03 100% | 119.1 MiB/s | 122.0 KiB | 00m00s [ 98/315] Installing perl-SelectSaver-0 100% | 0.0 B/s | 2.6 KiB | 00m00s [ 99/315] Installing perl-Symbol-0:1.09 100% | 0.0 B/s | 7.2 KiB | 00m00s [100/315] Installing perl-File-stat-0:1 100% | 0.0 B/s | 13.1 KiB | 00m00s [101/315] Installing perl-Pod-Perldoc-0 100% | 11.8 MiB/s | 169.2 KiB | 00m00s [102/315] Installing perl-podlators-1:6 100% | 22.4 MiB/s | 321.4 KiB | 00m00s [103/315] Installing perl-Fcntl-0:1.18- 100% | 0.0 B/s | 50.0 KiB | 00m00s [104/315] Installing perl-Text-ParseWor 100% | 0.0 B/s | 14.6 KiB | 00m00s [105/315] Installing perl-base-0:2.27-5 100% | 0.0 B/s | 12.9 KiB | 00m00s [106/315] Installing perl-mro-0:1.29-51 100% | 41.6 MiB/s | 42.6 KiB | 00m00s [107/315] Installing perl-IO-0:1.55-515 100% | 147.6 MiB/s | 151.1 KiB | 00m00s [108/315] Installing perl-overloading-0 100% | 0.0 B/s | 5.5 KiB | 00m00s [109/315] Installing perl-Pod-Usage-4:2 100% | 6.5 MiB/s | 86.3 KiB | 00m00s [110/315] Installing perl-Getopt-Std-0: 100% | 0.0 B/s | 11.7 KiB | 00m00s [111/315] Installing perl-File-Basename 100% | 0.0 B/s | 14.6 KiB | 00m00s [112/315] Installing perl-Scalar-List-U 100% | 145.1 MiB/s | 148.5 KiB | 00m00s [113/315] Installing perl-Errno-0:1.38- 100% | 0.0 B/s | 8.7 KiB | 00m00s [114/315] Installing perl-MIME-Base64-0 100% | 43.2 MiB/s | 44.3 KiB | 00m00s [115/315] Installing perl-constant-0:1. 100% | 0.0 B/s | 27.4 KiB | 00m00s [116/315] Installing perl-Storable-1:3. 100% | 228.4 MiB/s | 233.9 KiB | 00m00s [117/315] Installing perl-overload-0:1. 100% | 0.0 B/s | 71.9 KiB | 00m00s [118/315] Installing perl-parent-1:0.24 100% | 0.0 B/s | 11.0 KiB | 00m00s [119/315] Installing perl-vars-0:1.05-5 100% | 0.0 B/s | 4.3 KiB | 00m00s [120/315] Installing perl-Getopt-Long-1 100% | 143.8 MiB/s | 147.2 KiB | 00m00s [121/315] Installing perl-Carp-0:1.54-5 100% | 0.0 B/s | 47.7 KiB | 00m00s [122/315] Installing perl-Exporter-0:5. 100% | 0.0 B/s | 55.6 KiB | 00m00s [123/315] Installing perl-PathTools-0:3 100% | 180.2 MiB/s | 184.5 KiB | 00m00s [124/315] Installing perl-DynaLoader-0: 100% | 0.0 B/s | 32.5 KiB | 00m00s [125/315] Installing perl-Encode-4:3.21 100% | 187.8 MiB/s | 4.7 MiB | 00m00s [126/315] Installing perl-libs-4:5.40.1 100% | 275.2 MiB/s | 9.9 MiB | 00m00s [127/315] Installing perl-interpreter-4 100% | 9.0 MiB/s | 119.8 KiB | 00m00s [128/315] Installing perl-File-Find-0:1 100% | 0.0 B/s | 42.5 KiB | 00m00s [129/315] Installing perl-version-9:0.9 100% | 128.5 MiB/s | 131.5 KiB | 00m00s [130/315] Installing perl-File-Copy-0:2 100% | 0.0 B/s | 20.2 KiB | 00m00s [131/315] Installing perl-ExtUtils-Mani 100% | 84.3 MiB/s | 86.3 KiB | 00m00s [132/315] Installing perl-lib-0:0.65-51 100% | 0.0 B/s | 8.9 KiB | 00m00s [133/315] Installing perl-threads-1:2.4 100% | 114.4 MiB/s | 117.1 KiB | 00m00s [134/315] Installing perl-threads-share 100% | 83.8 MiB/s | 85.9 KiB | 00m00s [135/315] Installing perl-ExtUtils-Pars 100% | 28.3 MiB/s | 405.1 KiB | 00m00s [136/315] Installing perl-Compress-Raw- 100% | 161.6 MiB/s | 165.5 KiB | 00m00s [137/315] Installing perl-File-Compare- 100% | 0.0 B/s | 6.1 KiB | 00m00s [138/315] Installing perl-Time-HiRes-4: 100% | 115.0 MiB/s | 117.8 KiB | 00m00s [139/315] Installing perl-CPAN-Meta-Req 100% | 81.5 MiB/s | 83.4 KiB | 00m00s [140/315] Installing perl-Module-CoreLi 100% | 592.3 MiB/s | 1.2 MiB | 00m00s [141/315] Installing perl-Module-Metada 100% | 0.0 B/s | 69.0 KiB | 00m00s [142/315] Installing perl-Digest-SHA-1: 100% | 8.6 MiB/s | 115.0 KiB | 00m00s [143/315] Installing perl-Filter-2:1.64 100% | 81.1 MiB/s | 166.2 KiB | 00m00s [144/315] Installing perl-Module-Load-1 100% | 0.0 B/s | 15.9 KiB | 00m00s [145/315] Installing perl-Perl-OSType-0 100% | 0.0 B/s | 34.3 KiB | 00m00s [146/315] Installing perl-Term-ReadLine 100% | 0.0 B/s | 17.8 KiB | 00m00s [147/315] Installing perl-Tie-0:4.6-515 100% | 6.6 MiB/s | 33.7 KiB | 00m00s [148/315] Installing perl-Unicode-Norma 100% | 228.2 MiB/s | 467.4 KiB | 00m00s [149/315] Installing perl-meta-notation 100% | 0.0 B/s | 2.3 KiB | 00m00s [150/315] Installing perl-encoding-4:3. 100% | 146.9 MiB/s | 150.4 KiB | 00m00s [151/315] Installing perl-Net-Ping-0:2. 100% | 132.2 MiB/s | 135.3 KiB | 00m00s [152/315] Installing perl-ExtUtils-Comm 100% | 0.0 B/s | 10.2 KiB | 00m00s [153/315] Installing perl-Pod-Html-0:1. 100% | 3.3 MiB/s | 43.8 KiB | 00m00s [154/315] Installing perl-File-Which-0: 100% | 0.0 B/s | 31.4 KiB | 00m00s [155/315] Installing perl-AutoSplit-0:5 100% | 0.0 B/s | 23.5 KiB | 00m00s [156/315] Installing perl-Benchmark-0:1 100% | 35.9 MiB/s | 36.7 KiB | 00m00s [157/315] Installing perl-Test-Harness- 100% | 33.5 MiB/s | 582.4 KiB | 00m00s [158/315] Installing perl-ExtUtils-Inst 100% | 85.1 MiB/s | 87.2 KiB | 00m00s [159/315] Installing perl-ExtUtils-Make 100% | 48.5 MiB/s | 744.7 KiB | 00m00s [160/315] Installing perl-CPAN-Meta-YAM 100% | 0.0 B/s | 53.5 KiB | 00m00s [161/315] Installing perl-Compress-Raw- 100% | 68.0 MiB/s | 69.6 KiB | 00m00s [162/315] Installing perl-IO-Compress-0 100% | 64.5 MiB/s | 1.0 MiB | 00m00s [163/315] Installing perl-IO-Zlib-1:1.1 100% | 0.0 B/s | 26.7 KiB | 00m00s [164/315] Installing perl-Devel-PPPort- 100% | 436.7 MiB/s | 894.5 KiB | 00m00s [165/315] Installing perl-DirHandle-0:1 100% | 0.0 B/s | 3.8 KiB | 00m00s [166/315] Installing perl-Dumpvalue-0:2 100% | 0.0 B/s | 20.2 KiB | 00m00s [167/315] Installing perl-ExtUtils-Cons 100% | 85.5 MiB/s | 87.6 KiB | 00m00s [168/315] Installing perl-devel-4:5.40. 100% | 298.2 MiB/s | 8.1 MiB | 00m00s [169/315] Installing perl-ExtUtils-Embe 100% | 0.0 B/s | 16.1 KiB | 00m00s [170/315] Installing perl-ExtUtils-MM-U 100% | 0.0 B/s | 3.7 KiB | 00m00s [171/315] Installing perl-Hash-Util-Fie 100% | 62.8 MiB/s | 64.3 KiB | 00m00s [172/315] Installing perl-Hash-Util-0:0 100% | 55.0 MiB/s | 56.4 KiB | 00m00s [173/315] Installing perl-I18N-LangTags 100% | 81.6 MiB/s | 83.6 KiB | 00m00s [174/315] Installing perl-Locale-Makete 100% | 169.9 MiB/s | 173.9 KiB | 00m00s [175/315] Installing perl-Locale-Makete 100% | 0.0 B/s | 13.5 KiB | 00m00s [176/315] Installing perl-Params-Check- 100% | 0.0 B/s | 28.6 KiB | 00m00s [177/315] Installing perl-Module-Load-C 100% | 0.0 B/s | 29.9 KiB | 00m00s [178/315] Installing perl-IPC-Cmd-2:1.0 100% | 83.9 MiB/s | 85.9 KiB | 00m00s [179/315] Installing perl-ExtUtils-CBui 100% | 99.4 MiB/s | 101.7 KiB | 00m00s [180/315] Installing perl-Math-Complex- 100% | 0.0 B/s | 85.8 KiB | 00m00s [181/315] Installing perl-Math-BigInt-1 100% | 314.6 MiB/s | 966.4 KiB | 00m00s [182/315] Installing perl-JSON-PP-1:4.1 100% | 10.8 MiB/s | 143.6 KiB | 00m00s [183/315] Installing perl-CPAN-Meta-0:2 100% | 149.9 MiB/s | 613.8 KiB | 00m00s [184/315] Installing perl-NDBM_File-0:1 100% | 28.9 MiB/s | 29.6 KiB | 00m00s [185/315] Installing perl-SelfLoader-0: 100% | 0.0 B/s | 22.8 KiB | 00m00s [186/315] Installing perl-Sys-Hostname- 100% | 16.8 MiB/s | 17.2 KiB | 00m00s [187/315] Installing perl-Term-Table-0: 100% | 79.2 MiB/s | 81.1 KiB | 00m00s [188/315] Installing perl-Text-Balanced 100% | 110.1 MiB/s | 112.7 KiB | 00m00s [189/315] Installing perl-Tie-RefHash-0 100% | 0.0 B/s | 37.4 KiB | 00m00s [190/315] Installing perl-User-pwent-0: 100% | 0.0 B/s | 17.9 KiB | 00m00s [191/315] Installing perl-autouse-0:1.1 100% | 0.0 B/s | 6.3 KiB | 00m00s [192/315] Installing perl-subs-0:1.04-5 100% | 0.0 B/s | 2.5 KiB | 00m00s [193/315] Installing perl-Opcode-0:1.65 100% | 48.7 MiB/s | 49.8 KiB | 00m00s [194/315] Installing perl-Safe-0:2.46-5 100% | 0.0 B/s | 31.0 KiB | 00m00s [195/315] Installing perl-Params-Util-0 100% | 59.6 MiB/s | 61.0 KiB | 00m00s [196/315] Installing perl-Sub-Install-0 100% | 0.0 B/s | 37.2 KiB | 00m00s [197/315] Installing perl-Data-OptList- 100% | 51.0 MiB/s | 52.2 KiB | 00m00s [198/315] Installing perl-Filter-Simple 100% | 50.5 MiB/s | 51.7 KiB | 00m00s [199/315] Installing perl-Test-Simple-3 100% | 160.8 MiB/s | 1.8 MiB | 00m00s [200/315] Installing perl-Devel-SelfStu 100% | 0.0 B/s | 7.3 KiB | 00m00s [201/315] Installing perl-Memoize-0:1.1 100% | 65.0 MiB/s | 66.5 KiB | 00m00s [202/315] Installing perl-Math-BigInt-F 100% | 45.7 MiB/s | 46.8 KiB | 00m00s [203/315] Installing perl-bignum-0:0.67 100% | 133.3 MiB/s | 136.5 KiB | 00m00s [204/315] Installing perl-File-Fetch-0: 100% | 0.0 B/s | 60.2 KiB | 00m00s [205/315] Installing perl-fields-0:2.27 100% | 0.0 B/s | 12.2 KiB | 00m00s [206/315] Installing perl-ExtUtils-Mini 100% | 0.0 B/s | 8.8 KiB | 00m00s [207/315] Installing perl-DBM_Filter-0: 100% | 0.0 B/s | 30.5 KiB | 00m00s [208/315] Installing perl-libnetcfg-4:5 100% | 1.4 MiB/s | 17.3 KiB | 00m00s [209/315] Installing perl-inc-latest-2: 100% | 0.0 B/s | 36.3 KiB | 00m00s [210/315] Installing perl-File-HomeDir- 100% | 120.9 MiB/s | 123.8 KiB | 00m00s [211/315] Installing perl-open-0:1.13-5 100% | 0.0 B/s | 11.7 KiB | 00m00s [212/315] Installing perl-debugger-0:1. 100% | 393.8 MiB/s | 403.3 KiB | 00m00s [213/315] Installing perl-sigtrap-0:1.1 100% | 0.0 B/s | 11.4 KiB | 00m00s [214/315] Installing perl-Unicode-Colla 100% | 381.4 MiB/s | 4.2 MiB | 00m00s [215/315] Installing perl-Unicode-UCD-0 100% | 200.2 MiB/s | 205.0 KiB | 00m00s [216/315] Installing perl-Env-0:1.06-51 100% | 0.0 B/s | 27.2 KiB | 00m00s [217/315] Installing perl-Module-CoreLi 100% | 1.4 MiB/s | 19.3 KiB | 00m00s [218/315] Installing perl-Archive-Zip-0 100% | 20.8 MiB/s | 297.8 KiB | 00m00s [219/315] Installing perl-Thread-0:3.05 100% | 0.0 B/s | 12.5 KiB | 00m00s [220/315] Installing perl-Thread-Queue- 100% | 0.0 B/s | 30.4 KiB | 00m00s [221/315] Installing perl-Thread-Semaph 100% | 0.0 B/s | 10.6 KiB | 00m00s [222/315] Installing perl-experimental- 100% | 0.0 B/s | 43.9 KiB | 00m00s [223/315] Installing perl-Encode-devel- 100% | 7.6 MiB/s | 101.1 KiB | 00m00s [224/315] Installing perl-Pod-Checker-4 100% | 4.4 MiB/s | 53.5 KiB | 00m00s [225/315] Installing perl-diagnostics-0 100% | 35.0 MiB/s | 466.5 KiB | 00m00s [226/315] Installing perl-macros-4:5.40 100% | 0.0 B/s | 5.8 KiB | 00m00s [227/315] Installing perl-utils-0:5.40. 100% | 8.0 MiB/s | 98.5 KiB | 00m00s [228/315] Installing perl-Attribute-Han 100% | 0.0 B/s | 40.5 KiB | 00m00s [229/315] Installing perl-Config-Extens 100% | 0.0 B/s | 3.2 KiB | 00m00s [230/315] Installing perl-Config-Perl-V 100% | 0.0 B/s | 27.5 KiB | 00m00s [231/315] Installing perl-Devel-Peek-0: 100% | 0.0 B/s | 44.9 KiB | 00m00s [232/315] Installing perl-English-0:1.1 100% | 0.0 B/s | 6.6 KiB | 00m00s [233/315] Installing perl-File-DosGlob- 100% | 0.0 B/s | 22.2 KiB | 00m00s [234/315] Installing perl-FileCache-0:1 100% | 0.0 B/s | 7.9 KiB | 00m00s [235/315] Installing perl-FindBin-0:1.5 100% | 0.0 B/s | 7.1 KiB | 00m00s [236/315] Installing perl-GDBM_File-1:1 100% | 78.8 MiB/s | 80.7 KiB | 00m00s [237/315] Installing perl-I18N-Collate- 100% | 0.0 B/s | 7.6 KiB | 00m00s [238/315] Installing perl-I18N-Langinfo 100% | 0.0 B/s | 36.1 KiB | 00m00s [239/315] Installing perl-IPC-SysV-0:2. 100% | 74.9 MiB/s | 76.7 KiB | 00m00s [240/315] Installing perl-Module-Loaded 100% | 0.0 B/s | 5.5 KiB | 00m00s [241/315] Installing perl-NEXT-0:0.69-5 100% | 0.0 B/s | 23.9 KiB | 00m00s [242/315] Installing perl-Net-0:1.04-51 100% | 0.0 B/s | 23.7 KiB | 00m00s [243/315] Installing perl-ODBM_File-0:1 100% | 0.0 B/s | 29.5 KiB | 00m00s [244/315] Installing perl-PerlIO-via-Qu 100% | 0.0 B/s | 32.1 KiB | 00m00s [245/315] Installing perl-Pod-Functions 100% | 0.0 B/s | 14.6 KiB | 00m00s [246/315] Installing perl-Search-Dict-0 100% | 0.0 B/s | 5.2 KiB | 00m00s [247/315] Installing perl-Sys-Syslog-0: 100% | 94.6 MiB/s | 96.9 KiB | 00m00s [248/315] Installing perl-Term-Complete 100% | 0.0 B/s | 6.3 KiB | 00m00s [249/315] Installing perl-Test-0:1.31-5 100% | 0.0 B/s | 37.4 KiB | 00m00s [250/315] Installing perl-Text-Abbrev-0 100% | 0.0 B/s | 3.6 KiB | 00m00s [251/315] Installing perl-Tie-File-0:1. 100% | 0.0 B/s | 86.2 KiB | 00m00s [252/315] Installing perl-Tie-Memoize-0 100% | 0.0 B/s | 6.7 KiB | 00m00s [253/315] Installing perl-Time-0:1.04-5 100% | 0.0 B/s | 10.8 KiB | 00m00s [254/315] Installing perl-Time-Piece-0: 100% | 71.0 MiB/s | 72.7 KiB | 00m00s [255/315] Installing perl-blib-0:1.07-5 100% | 0.0 B/s | 3.6 KiB | 00m00s [256/315] Installing perl-deprecate-0:0 100% | 6.8 MiB/s | 6.9 KiB | 00m00s [257/315] Installing perl-doc-0:5.40.1- 100% | 410.0 MiB/s | 11.1 MiB | 00m00s [258/315] Installing perl-encoding-warn 100% | 0.0 B/s | 10.6 KiB | 00m00s [259/315] Installing perl-filetest-0:1. 100% | 0.0 B/s | 6.8 KiB | 00m00s [260/315] Installing perl-less-0:0.03-5 100% | 0.0 B/s | 5.3 KiB | 00m00s [261/315] Installing perl-perlfaq-0:5.2 100% | 359.8 MiB/s | 736.9 KiB | 00m00s [262/315] Installing perl-ph-0:5.40.1-5 100% | 269.0 MiB/s | 275.4 KiB | 00m00s [263/315] Installing perl-sort-0:2.05-5 100% | 0.0 B/s | 5.2 KiB | 00m00s [264/315] Installing perl-vmsish-0:1.04 100% | 0.0 B/s | 6.9 KiB | 00m00s [265/315] Installing perl-Compress-Bzip 100% | 141.9 MiB/s | 145.3 KiB | 00m00s [266/315] Installing perl-Devel-Size-0: 100% | 42.5 MiB/s | 43.5 KiB | 00m00s [267/315] Installing perl-Text-Glob-0:0 100% | 0.0 B/s | 9.3 KiB | 00m00s [268/315] Installing perl-local-lib-0:2 100% | 117.6 MiB/s | 120.4 KiB | 00m00s [269/315] Installing perl-IPC-System-Si 100% | 71.8 MiB/s | 73.5 KiB | 00m00s [270/315] Installing perl-autodie-0:2.3 100% | 214.0 MiB/s | 219.1 KiB | 00m00s [271/315] Installing perl-Compress-Raw- 100% | 120.4 MiB/s | 123.3 KiB | 00m00s [272/315] Installing perl-IO-Compress-L 100% | 215.2 MiB/s | 220.4 KiB | 00m00s [273/315] Installing perl-Algorithm-Dif 100% | 106.9 MiB/s | 109.5 KiB | 00m00s [274/315] Installing perl-Text-Diff-0:1 100% | 83.1 MiB/s | 85.1 KiB | 00m00s [275/315] Installing perl-Archive-Tar-0 100% | 11.7 MiB/s | 156.4 KiB | 00m00s [276/315] Installing perl-Module-Signat 100% | 10.6 MiB/s | 141.7 KiB | 00m00s [277/315] Installing perl-Text-Template 100% | 111.3 MiB/s | 114.0 KiB | 00m00s [278/315] Installing perl-MRO-Compat-0: 100% | 43.8 MiB/s | 44.9 KiB | 00m00s [279/315] Installing perl-Package-Gener 100% | 30.8 MiB/s | 31.5 KiB | 00m00s [280/315] Installing perl-Sub-Exporter- 100% | 197.2 MiB/s | 201.9 KiB | 00m00s [281/315] Installing perl-Data-Section- 100% | 43.0 MiB/s | 44.1 KiB | 00m00s [282/315] Installing perl-Software-Lice 100% | 167.4 MiB/s | 514.4 KiB | 00m00s [283/315] Installing perl-Module-Build- 100% | 43.2 MiB/s | 663.2 KiB | 00m00s [284/315] Installing perl-TermReadKey-0 100% | 64.6 MiB/s | 66.2 KiB | 00m00s [285/315] Installing perl-Error-1:0.170 100% | 78.1 MiB/s | 80.0 KiB | 00m00s [286/315] Installing perl-Git-0:2.48.1- 100% | 0.0 B/s | 65.0 KiB | 00m00s [287/315] Installing git-0:2.48.1-3.fc4 100% | 85.4 MiB/s | 87.5 KiB | 00m00s [288/315] Installing rocm-clang-0:18-37 100% | 83.9 MiB/s | 117.6 MiB | 00m01s [289/315] Installing rocm-clang-devel-0 100% | 116.7 MiB/s | 21.9 MiB | 00m00s [290/315] Installing rocm-device-libs-0 100% | 89.8 MiB/s | 3.2 MiB | 00m00s [291/315] Installing rocm-comgr-devel-0 100% | 101.9 MiB/s | 104.4 KiB | 00m00s [292/315] Installing hipcc-0:18-37.rocm 100% | 35.5 MiB/s | 762.6 KiB | 00m00s [293/315] Installing rocm-hip-0:6.3.1-3 100% | 394.7 MiB/s | 23.3 MiB | 00m00s [294/315] Installing libdb-0:5.3.28-64. 100% | 374.1 MiB/s | 1.9 MiB | 00m00s [295/315] Installing perl-DB_File-0:1.8 100% | 186.1 MiB/s | 190.6 KiB | 00m00s [296/315] Installing perl-CPAN-0:2.38-3 100% | 99.8 MiB/s | 1.9 MiB | 00m00s [297/315] Installing perl-4:5.40.1-515. 100% | 0.0 B/s | 124.0 B | 00m00s [298/315] Installing llvm19-filesystem- 100% | 0.0 B/s | 1.1 KiB | 00m00s [299/315] Installing llvm19-libs-0:19.1 100% | 449.4 MiB/s | 124.0 MiB | 00m00s [300/315] Installing clang19-resource-f 100% | 15.8 MiB/s | 16.2 KiB | 00m00s [301/315] Installing clang19-libs-0:19. 100% | 498.8 MiB/s | 124.2 MiB | 00m00s [302/315] Installing emacs-filesystem-1 100% | 0.0 B/s | 544.0 B | 00m00s [303/315] Installing rhash-0:1.4.5-2.fc 100% | 23.2 MiB/s | 356.4 KiB | 00m00s [304/315] Installing libuv-1:1.50.0-1.f 100% | 278.1 MiB/s | 569.6 KiB | 00m00s [305/315] Installing jsoncpp-0:1.9.5-9. 100% | 29.0 MiB/s | 267.1 KiB | 00m00s [306/315] Installing cmake-data-0:3.31. 100% | 124.1 MiB/s | 9.1 MiB | 00m00s [307/315] Installing cmake-0:3.31.5-1.f 100% | 352.7 MiB/s | 34.2 MiB | 00m00s [308/315] Installing rocm-cmake-0:6.3.0 100% | 131.7 MiB/s | 134.9 KiB | 00m00s [309/315] Installing hipify-0:6.3.0-3.f 100% | 150.7 MiB/s | 2.9 MiB | 00m00s [310/315] Installing rocm-hip-devel-0:6 100% | 149.3 MiB/s | 2.7 MiB | 00m00s [311/315] Installing rocm-rpm-macros-0: 100% | 0.0 B/s | 19.3 KiB | 00m00s [312/315] Installing rocm-smi-devel-0:6 100% | 234.2 MiB/s | 239.8 KiB | 00m00s [313/315] Installing rocm-core-devel-0: 100% | 0.0 B/s | 16.1 KiB | 00m00s [314/315] Installing annobin-plugin-gcc 100% | 74.6 MiB/s | 993.4 KiB | 00m00s [315/315] Installing gcc-plugin-annobin 100% | 320.5 KiB/s | 58.6 KiB | 00m00s Warning: skipped OpenPGP checks for 26 packages from repository: copr_base Complete! Finish: build setup for rccl-6.3.0-3.fc42.src.rpm Start: rpmbuild rccl-6.3.0-3.fc42.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1737158400 Executing(%mkbuilddir): /bin/sh -e /var/tmp/rpm-tmp.p5YAB2 + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + test -d /builddir/build/BUILD/rccl-6.3.0-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/rccl-6.3.0-build + /usr/bin/rm -rf /builddir/build/BUILD/rccl-6.3.0-build + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.3.0-build + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/SPECPARTS + RPM_EC=0 ++ jobs -p + exit 0 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.5PF7Hx + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + cd /builddir/build/BUILD/rccl-6.3.0-build + rm -rf rccl-rocm-6.3.0 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/RCCL-6.3.0.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd rccl-rocm-6.3.0 + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e '/AMD GPU targets to compile for/d' CMakeLists.txt + sed -i -e 's@cat ${ROCM_PATH}/.info/version@echo 6.3.0@' CMakeLists.txt + sed -i -e s@rocm-core/rocm_version.h@rocm_version.h@ src/include/hip_rocm_version_info.h + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.JF887Q + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.3.0 + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DCMAKE_INSTALL_FULL_SBINDIR:PATH=/usr/bin -DCMAKE_INSTALL_SBINDIR:PATH=bin -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON -DBUILD_TESTS=OFF -DCMAKE_BUILD_TYPE=RelWithDebInfo -DCMAKE_CXX_COMPILER=/usr/bin/hipcc -DCMAKE_C_COMPILER=/usr/bin/hipcc -DCMAKE_EXPORT_COMPILE_COMMANDS=OFF -DCMAKE_SKIP_RPATH=ON -DBUILD_FILE_REORG_BACKWARD_COMPATIBILITY=OFF -DCMAKE_INSTALL_LIBDIR=/usr/lib64 -DROCM_SYMLINK_LIBS=OFF '-DAMDGPU_TARGETS=gfx90a:xnack+;gfx90a:xnack-;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201' -DHIP_PLATFORM=amd -DRCCL_ROCPROFILER_REGISTER=OFF CMake Deprecation Warning at CMakeLists.txt:6 (cmake_minimum_required): Compatibility with CMake < 3.10 will be removed from a future version of CMake. Update the VERSION argument value. Or, use the ... syntax to tell CMake that the project requires at least but has been updated to work with policies introduced by or earlier. -- CMAKE_TOOLCHAIN_FILE: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/toolchain-linux.cmake -- The CXX compiler identification is Clang 18.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- Checking for ROCm support for GPU targets: gfx90a:xnack+;gfx90a:xnack-;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 - Success -- Compiling for gfx90a:xnack+;gfx90a:xnack-;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- ROCM_PATH found: /opt/rocm -- Compiling with hipcc -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- hipcc version: 6.3.42133 -- hipconfig executable: /usr/bin/hipconfig -- hipcc HIP version: 6.3.42133 -- ROCm version: 6.3.0 ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:87 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:191 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:88 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:191 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:99 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:73 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:191 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:87 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:194 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:88 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:194 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocContiguous -- Looking for hipDeviceMallocContiguous - found -- RCCL LL128 protocol enabled ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:99 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:73 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:194 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- RSMI_INIT_FLAG_THRAD_ONLY_MUTEX supported -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled CMake Warning at CMakeLists.txt:301 (message): Can only build MSCCL++ for gfx942; disabling MSCCL++ build -- Found Python3: /usr/bin/python3.13 (found version "3.13.2") found components: Interpreter -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.h -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp -- HIP_CONTIGUOUS_MEMORY enabled -- HIP_UNCACHED_MEMORY enabled cat: /sys/fs/cgroup/memory/memory.limit_in_bytes: No such file or directory -- Use 1 jobs for linking -- Building shared RCCL library -- rocm-cmake: Set license file to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/LICENSE.txt. -- Configuring done (18.5s) -- Generating done (0.0s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_CXX_FLAGS_RELEASE CMAKE_C_FLAGS_RELEASE CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j4 --verbose Change Dir: '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j4 /usr/bin/cmake -S/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0 -B/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/CMakeFiles /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' cd /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0 /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0 /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' [ 0%] Built target git_version_check /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' [ 1%] Hipifying src/transport/shm.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc [ 1%] Hipifying src/channel.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc [ 2%] Hipifying src/collectives.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc [ 1%] Hipifying src/bootstrap.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/channel.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/shm.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/bootstrap.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/collectives.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc [ 2%] Hipifying src/debug.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/debug.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc [ 2%] Hipifying src/device/all_gather.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/all_gather.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h [ 3%] Hipifying src/device/all_reduce.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/all_reduce.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h [ 3%] Hipifying src/device/alltoall_pivot.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/alltoall_pivot.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h [ 3%] Hipifying src/device/broadcast.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/broadcast.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h [ 3%] Hipifying src/device/common.cu -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/common.cu -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp [ 3%] Hipifying src/device/common.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/common.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h [ 4%] Hipifying src/device/common_kernel.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/common_kernel.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h [ 4%] Hipifying src/device/msccl_kernel_impl.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/msccl_kernel_impl.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h [ 4%] Hipifying src/device/network/unpack/unpack.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/network/unpack/unpack.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h [ 5%] Hipifying src/device/network/unpack/unpack_defs.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/network/unpack/unpack_defs.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h [ 5%] Hipifying src/device/onerank.cu -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/onerank.cu -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack.h [ 5%] Hipifying src/device/op128.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/op128.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/op128.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/op128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/op128.h [ 5%] Hipifying src/device/primitives.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/primitives.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 6%] Hipifying src/device/prims_ll.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/prims_ll.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h [ 6%] Hipifying src/device/prims_ll128.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/prims_ll128.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/op128.h [ 6%] Hipifying src/device/prims_simple.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/prims_simple.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h [ 6%] Hipifying src/device/reduce.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/reduce.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h [ 6%] Hipifying src/device/reduce_kernel.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/reduce_kernel.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h [ 7%] Hipifying src/device/reduce_scatter.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/reduce_scatter.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h [ 7%] Hipifying src/device/sendrecv.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/device/sendrecv.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h [ 7%] Hipifying src/enqueue.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/enqueue.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_kernel.h [ 7%] Hipifying src/graph/connect.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/connect.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc [ 7%] Hipifying src/graph/paths.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/paths.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h [ 8%] Hipifying src/graph/rings.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/rings.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc [ 8%] Hipifying src/graph/rings.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/rings.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.h [ 8%] Hipifying src/graph/rome_models.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/rome_models.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc [ 8%] Hipifying src/graph/rome_models.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/rome_models.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.h [ 9%] Hipifying src/graph/search.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/search.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc [ 9%] Hipifying src/graph/topo.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/topo.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc [ 9%] Hipifying src/graph/topo.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/topo.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h [ 9%] Hipifying src/graph/trees.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/trees.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/trees.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/trees.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/trees.cc [ 10%] Hipifying src/graph/tuning.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/tuning.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc [ 10%] Hipifying src/graph/xml.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/xml.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc [ 10%] Hipifying src/graph/xml.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/graph/xml.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h [ 10%] Hipifying src/group.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/group.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc [ 10%] Hipifying src/include/BfdBacktrace.hpp -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/BfdBacktrace.hpp -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp [ 10%] Hipifying src/include/align.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/align.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/align.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/align.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/align.h [ 11%] Hipifying src/include/alloc.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/alloc.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h [ 11%] Hipifying src/include/alt_rsmi.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alt_rsmi.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/alt_rsmi.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alt_rsmi.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alt_rsmi.h [ 11%] Hipifying src/include/api_trace.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/api_trace.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/api_trace.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/api_trace.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/api_trace.h [ 11%] Hipifying src/include/archinfo.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/archinfo.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/archinfo.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/archinfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/archinfo.h [ 12%] Hipifying src/include/argcheck.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/argcheck.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h [ 12%] Hipifying src/include/bootstrap.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/bootstrap.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h [ 12%] Hipifying src/include/channel.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/channel.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h [ 13%] Hipifying src/include/checks.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/checks.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/checks.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/checks.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/checks.h [ 13%] Hipifying src/include/coll_net.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/coll_net.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h [ 13%] Hipifying src/include/collectives.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/collectives.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/collectives.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/collectives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/collectives.h [ 13%] Hipifying src/include/comm.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/comm.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h [ 14%] Hipifying src/include/core.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/core.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h [ 14%] Hipifying src/include/cpuset.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/cpuset.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/cpuset.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/cpuset.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/cpuset.h [ 14%] Hipifying src/include/debug.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/debug.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/debug.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/debug.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/debug.h [ 14%] Hipifying src/include/device.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/device.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h [ 15%] Hipifying src/include/enqueue.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/enqueue.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h [ 15%] Hipifying src/include/gdrwrap.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/gdrwrap.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h [ 15%] Hipifying src/include/git_version.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/git_version.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/git_version.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/git_version.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/git_version.h [ 15%] Hipifying src/include/graph.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/graph.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h [ 16%] Hipifying src/include/group.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/group.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h [ 16%] Hipifying src/include/hip_rocm_version_info.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/hip_rocm_version_info.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h [ 16%] Hipifying src/include/ibvcore.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvcore.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/ibvcore.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvcore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvcore.h [ 16%] Hipifying src/include/ibvsymbols.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvsymbols.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/ibvsymbols.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvsymbols.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvsymbols.h [ 17%] Hipifying src/include/ibvwrap.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/ibvwrap.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h [ 17%] Hipifying src/include/info.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/info.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h [ 17%] Hipifying src/include/ipcsocket.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ipcsocket.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/ipcsocket.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ipcsocket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ipcsocket.h [ 17%] Hipifying src/include/msccl/msccl_kernel.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_kernel.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h [ 17%] Hipifying src/include/msccl/msccl_lifecycle.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_lifecycle.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h [ 18%] Hipifying src/include/msccl/msccl_parser.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_parser.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h [ 18%] Hipifying src/include/msccl/msccl_scheduler.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_scheduler.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h [ 18%] Hipifying src/include/msccl/msccl_setup.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_setup.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h [ 18%] Hipifying src/include/msccl/msccl_status.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_status.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h [ 19%] Hipifying src/include/msccl/msccl_struct.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/msccl/msccl_struct.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h [ 19%] Hipifying src/include/nccl_common.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_common.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nccl_common.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_common.h [ 20%] Hipifying src/include/nccl_net.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_net.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nccl_net.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_net.h [ 20%] Hipifying src/include/nccl_tuner.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_tuner.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nccl_tuner.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nccl_tuner.h [ 20%] Hipifying src/include/net.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/net.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h [ 20%] Hipifying src/include/net_device.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net_device.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/net_device.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net_device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net_device.h [ 20%] Hipifying src/include/npkit/npkit.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/npkit/npkit.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h [ 20%] Hipifying src/include/npkit/npkit_event.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit_event.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/npkit/npkit_event.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit_event.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit_event.h [ 20%] Hipifying src/include/npkit/npkit_struct.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/npkit/npkit_struct.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h [ 21%] Hipifying src/include/nvmlwrap.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvmlwrap.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvmlwrap.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvmlwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvmlwrap.h [ 21%] Hipifying src/include/nvtx.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h [ 21%] Hipifying src/include/nvtx3/nvToolsExt.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvToolsExt.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvToolsExtCuda.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvToolsExtCudaRt.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvToolsExtOpenCL.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvToolsExtPayload.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvToolsExtSync.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h [ 23%] Hipifying src/include/nvtx3/nvtx3.hpp -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtx3.hpp -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp [ 23%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h [ 23%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h [ 24%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h [ 24%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h [ 24%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h [ 26%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h [ 26%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h [ 27%] Hipifying src/include/nvtx_stub.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx_stub.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/nvtx_stub.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx_stub.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx_stub.h [ 27%] Hipifying src/include/p2p.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/p2p.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/p2p.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/p2p.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/p2p.h [ 28%] Hipifying src/include/param.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/param.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/param.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/param.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/param.h [ 28%] Hipifying src/include/profiler.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/profiler.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h [ 28%] Hipifying src/include/proxy.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/proxy.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h [ 28%] Hipifying src/include/rccl_float8.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/rccl_float8.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h [ 28%] Hipifying src/include/rccl_vars.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_vars.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/rccl_vars.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_vars.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_vars.h [ 29%] Hipifying src/include/register.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/register.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/register.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/register.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/register.h [ 29%] Hipifying src/include/rocm_smi_wrap.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/rocm_smi_wrap.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h [ 29%] Hipifying src/include/rocmwrap.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rocmwrap.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/rocmwrap.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rocmwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rocmwrap.h [ 30%] Hipifying src/include/roctx.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/roctx.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h [ 30%] Hipifying src/include/shm.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/shm.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/shm.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/shm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/shm.h [ 30%] Hipifying src/include/signals.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/signals.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/signals.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/signals.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/signals.h [ 30%] Hipifying src/include/socket.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/socket.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/socket.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/socket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/socket.h [ 31%] Hipifying src/include/strongstream.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/strongstream.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/strongstream.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/strongstream.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/strongstream.h [ 31%] Hipifying src/include/timer.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/timer.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/timer.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/timer.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/timer.h [ 31%] Hipifying src/include/transport.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/transport.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h [ 32%] Hipifying src/include/trees.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/trees.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/trees.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/trees.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/trees.h [ 32%] Hipifying src/include/tuner.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/tuner.h [ 32%] Hipifying src/include/utils.h -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/tuner.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/tuner.h mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/include/utils.h -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h [ 33%] Hipifying src/init.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/init.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc [ 33%] Hipifying src/init_nvtx.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/init_nvtx.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc [ 34%] Hipifying src/misc/alt_rsmi.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/alt_rsmi.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc [ 34%] Hipifying src/misc/api_trace.c -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.c mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/api_trace.c -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.c && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.c [ 35%] Hipifying src/misc/api_trace.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/api_trace.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc [ 35%] Hipifying src/misc/archinfo.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/archinfo.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/archinfo.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/archinfo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/archinfo.cc [ 35%] Hipifying src/misc/argcheck.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/argcheck.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc [ 35%] Hipifying src/misc/ibvsymbols.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/ibvsymbols.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc [ 35%] Hipifying src/misc/ibvwrap.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/ibvwrap.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc [ 35%] Hipifying src/misc/ipcsocket.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/ipcsocket.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc [ 35%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/msccl/msccl_lifecycle.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc [ 35%] Hipifying src/misc/msccl/msccl_parser.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/msccl/msccl_parser.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc [ 36%] Hipifying src/misc/msccl/msccl_setup.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/msccl/msccl_setup.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc [ 36%] Hipifying src/misc/msccl/msccl_status.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/msccl/msccl_status.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc [ 37%] Hipifying src/misc/npkit.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/npkit.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc [ 37%] Hipifying src/misc/nvmlwrap_stub.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/nvmlwrap_stub.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc [ 37%] Hipifying src/misc/param.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/param.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/param.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/param.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/param.cc [ 37%] Hipifying src/misc/profiler.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/profiler.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc [ 38%] Hipifying src/misc/rocm_smi_wrap.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/rocm_smi_wrap.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc [ 38%] Hipifying src/misc/rocmwrap.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/rocmwrap.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc [ 38%] Hipifying src/misc/roctx.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/roctx.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc [ 39%] Hipifying src/misc/shmutils.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/shmutils.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc [ 39%] Hipifying src/misc/signals.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/signals.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/signals.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/signals.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/signals.cc [ 39%] Hipifying src/misc/socket.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/socket.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc [ 39%] Hipifying src/misc/strongstream.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/strongstream.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/strongstream.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/strongstream.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/strongstream.cc [ 40%] Hipifying src/misc/tuner.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/tuner.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/tuner.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/tuner.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/tuner.cc [ 40%] Hipifying src/misc/utils.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/misc/utils.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc [ 40%] Hipifying src/net.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc [ 40%] Hipifying src/msccl.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/msccl.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/net.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc [ 41%] Hipifying src/proxy.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/proxy.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc [ 41%] Hipifying src/register.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/register.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc [ 41%] Hipifying src/transport.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc [ 41%] Hipifying src/transport/coll_net.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/coll_net.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc [ 42%] Hipifying src/transport/net_ib.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/net_ib.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc [ 42%] Hipifying src/transport/net_socket.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/net_socket.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc [ 42%] Hipifying src/transport/net.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/net.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc [ 42%] Hipifying src/transport/nvls.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/nvls.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc [ 42%] Hipifying src/transport/p2p.cc -> /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/src/transport/p2p.cc -o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc cd /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0 /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0 /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.ccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ :210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ :22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ 10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ :249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ :80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllTo/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ A:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] llv p312ayload{sendcounts[comm->rank] * ncclTypeSize( | datatype), recvcounts[comm->ra nk] * ncclTypeSize(dataty constexpr nvtxPaylope)}; | ^~~~~~~ adSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.ccIn file included from :173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ :356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ 1/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ warning generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ :381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | Nvtx/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ ParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t Red: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ uceSc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cca:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ tterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ :381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxPa/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ ramsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ :312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:22:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:25:10: warning: unused variable 'msgsize' [-Wunused-variable] 25 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:173:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:52:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 52 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:57:23: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long l/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.ccog2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int*:80:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 80 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:83:10: warning: unused variable 'msgsize' [-Wunused-variable] 83 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:128:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 128 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ In file included from 1 warning generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ 13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc64_t id, int:*173:38 : warning: unused variable 'BroadcastSchema' [-Wunused-variable]i ndex) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 173 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:177:23: warning: unused variable 'payload' [-Wunused-variable] 177 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:210:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 210 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:214:22: warning: unused variable 'payload' [-Wunused-variable] 214 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:249:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 249 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:255:20: warning: unused variable 'payload' [-Wunused-variable] 255 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:281:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 281 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:286:27: warning: unused variable 'payload' [-Wunused-variable] 286 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:312:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 312 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:316:23: warning: unused variable 'payload' [-Wunused-variable] 316 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxPar1 warning generated when compiling for gfx1200. amsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:381:22: warning: unused variable 'payload' [-Wunused-variable] 381 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ 1 warning generated when compiling for gfx90a. 1In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/collectives.cc:345:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 345 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 31 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx90a. 3131 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for host. [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 3/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc warnings generated when compiling for gfx1100. :506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 3 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ 3 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ 3 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} 3 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ 3 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:100:5: warning: unused label 'ignore0' [-Wunused-label] 100 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int rnChannels = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:399:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 399 | int r/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ nChannels = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:506:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 506 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.ccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from :610:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 610 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h4:204:19: warning: unused variable 'md' [-Wunused-variable] warnings generated when compiling for gfx1100. 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 4 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ In file included from 4 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static boolIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:47:21: warning: unused function 'computeColl' [-Wunused-function] 47 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:61:21: warning: unused function 'getLoopInfo' [-Wunused-function] 61 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/enqueue.cc:794:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 794 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/msccl.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc 37 warnings generated when compiling for gfx1102. 37 warnings generated when compiling for gfx1100. 37 warnings generated when compiling for gfx1201. 37 warnings generated when compiling for gfx1200. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ 37 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t col/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ lNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc{rank, nranks, cudaDev};:2357 :26: warning: unused variable 'payload' [-Wunused-variable] | 2357 | Nvt ^~~~~~~xPar amsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.ccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1967:11: warning: unused variable 'stackSize' [-Wunused-variable] 1967 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:1968:19: warning: unused variable 'devProp' [-Wunused-variable] 1968 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2357:26: warning: unused variable 'payload' [-Wunused-variable] 2357 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2371:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2371 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclRes/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2701:26: warning: unused variable 'payload' [-Wunused-variable] 2701 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ ult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2731:26: warning: unused variable 'payload' [-Wunused-variable] 2731 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, floaIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ t* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 55 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:906:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 906 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/init.cc:2342:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2342 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 55 warnings generated when compiling for gfx90a. 55 warnings generated when compiling for gfx1100. 55 warnings generated when compiling for gfx1101. 55 warnings generated when compiling for gfx90a. 55 warnings generated when compiling for gfx1102. 55 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for host. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:53:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 53 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:57:19: warning: unused variable 'payload' [-Wunused-variable] 57 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 7 warnings generated when compiling for gfx1200. 7 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for host. 7 warnings generated when compiling for gfx1201. 7 warnings generated when compiling for gfx90a. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/register.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/register.cc.o -MF CMakeFiles/rccl.dir/hipify/src/register.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/register.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx1100. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc 55 warnings generated when compiling for host. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from 2 warning/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ s generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for host. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc 2 warnings generated when compiling for host. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9h: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12e: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ ad, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ :120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:120:12: warning: unused variable 'y' [-Wunused-variable] 120 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:127:7: warning: unused variable 'localRanks' [-Wunused-variable] 127 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:272:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 272 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:459:24: warning: unused variable 'gpu' [-Wunused-variable] 459 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n)In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:261:21: warning: unused function 'getIndexes' [-Wunused-function] 261 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/connect.cc:435:21: warning: unused function 'connectNvls' [-Wunused-function] 435 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrNamIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ e, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const chIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cca:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetr* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179In file included from | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.hs:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:t46:13: warning: unused function 'log2i' [-Wunused-function] a46 | static lotng log2i(ilong n)c { AttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] | 117 ^~~~~ | static ncclResult_t xmlGetAttrIntDefault(strucIn file included from t /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.ccn:c10c: lX/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.hm:l233N:o21d:e *warning: nunused function 'ncclTopoDevToRank' [-Wunused-function]o de, const char* attrName, 233i | nstt*a tviacl unec,c liRnets udletf_at unlctcVlaTloupeo)D e{v T o| R ^~~~~~~~~~~~~~~~~~~~a nk(struct /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hn:c124c:l21T:o pwarning: ounused function 'xmlGetAttrLong' [-Wunused-function]S ystem* system, in t124 | dsetva,t iicn tn* crcalnRke) s{u l t| _ ^~~~~~~~~~~~~~~~~t xmlGetAtt/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.hr:L245o:n21g:( swarning: tunused function 'ncclTopoIdToNetDev' [-Wunused-function]r uct ncclXmlNod e245* | sntoadtei,c cnocncsltR ecshualrt*_ ta tntcrcNlaTmoep,o IidnTto6N4e_ttD*e vv(asltureu)c t{ n c| c ^~~~~~~~~~~~~~ lTopoSy/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hs:t132e:m21*: warning: sunused function 'xmlGetAttrFloat' [-Wunused-function]y stem, int64_t 132i | ds,t aitnitc* nncectlDReevs)u l{t _ t| ^~~~~~~~~~~~~~~~~~ xmlGetAt/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.ht:r268F:l14o:a twarning: (unused function 'ncclTopoNVLinkBw' [-Wunused-function]s truct ncclX m268l | Nsotdaet*i cn ofdleo,a tc onncsctl TcohpaorN* VaLtitnrkNBawm(ei,n tf lcouadta*C ovmaplCuaep)) {{ | | ^~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h139::27921::13 :warning: unused function 'xmlFindTag' [-Wunused-function]warning: unused function 'isPow2' [-Wunused-function] 279 | stati c139 | bsotoalt iic snPcocwl2R(eisnutl tv_atl )x m{l F i| n ^~~~~~ dTag(struct ncIn file included from c/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.ccl:X13m: l/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h*: 41x:m21l:, warning: cunused function 'ncclChannelCompute' [-Wunused-function]o nst char* tagName, str u41c | ts tnactcilcX mnlcNocdleR**e snuoldte_)t {n c c| l ^~~~~~~~~~C hannel/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hC:o151m:p21u: twarning: eunused function 'xmlFindNextTag' [-Wunused-function]( struct ncclComm* c o151m | ms,t aitnitc pnececrl,R eisnutl tc_hta xnmnleFliInndcN,e ixnttT acgo(lslt,r uicntt *ncchcalnXnmell*I dx)m l{, c| ^~~~~~~~~~~~~~~~~~o nst char* In file included from t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cca:g14N: a/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hm:e75,: 21s:t rwarning: uunused function 'xmlAlloc' [-Wunused-function]c t ncclXmlNo d75e | *s tparteivc, nsctcrluRcets unlctc_ltX mxlmNloAdel*l*o cn(osdte)r u{c t | ^~~~~~~~~~~~~~n cclX/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hm:l163*:*21 :x mwarning: lunused function 'xmlFindTagKv' [-Wunused-function], int maxNodes) { | ^~~~~~~~ 163 | stati/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hc: 110n:c21c:l Rwarning: eunused function 'xmlGetAttrInt' [-Wunused-function]s ult_t xmlFindTa g110K | vs(tstartuicct nnccccllRXemslu*l tx_mtl ,x mcloGnesttA tcthraIrn*t (tsatgrNuacmte ,n csctlrXumcltN ondcec*lX mnloNdoed,e *c*o nnsotd ec,h acro*n satt tcrhNaarm*e ,a titnrtN*a mvea,l uceo)n s{t | c ^~~~~~~~~~~~~h ar* at/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.ht:r117V:a21l:u ewarning: )unused function 'xmlGetAttrIntDefault' [-Wunused-function] { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 117 | static nccl R179e | ssutlatt_itc xnmclcGleRteAstutlrtI_ntt DxemflaSuelttA(tsttrr(usctrtu cntc cnlcXcmllXNmoldNeo*d en*o dne,o dceo,ns tc ocnhsatr *c haatrt*r Naatmter,N aimnet,* cvoanlsute ,c hinatr *d evfaaluulet)V a{l u e| ) ^~~~~~~~~~ { | ^~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h :192:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hwarning: :unused function 'xmlSetAttrIfUnset' [-Wunused-function]124 :21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 192 | s124t | asttiact incc cnlcRcelsRueslutl_tt_ t xxmmllSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult ncclResult_t xmlSetAttr(struct GetnAttrLong(cstruct cncclXmllNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclReXmlNodsult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* vale* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | stIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | satic ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | statue, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncc_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] lResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ tatic ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, strucic ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ t ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 13 warnings generated when compiling for gfx1101. 1313 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1201. 30 warnings generated when compiling for gfx1102. 30 warnings generated when compiling for gfx1200. 30 warnings generated when compiling for gfx1100. 30 warnings generated when compiling for gfx1101. 30 warnings generated when compiling for gfx1201. 30 warnings generated when compiling for gfx90a. 30 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc 30 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc13:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ : /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ 2 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ :1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ vs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | 2 warnings generated when compiling for gfx1200. ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ :1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_P:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] E R1563M | U T E _ CiOnUtN Tj ,( NrU[MnAg_pPuEsR]M,U TgE[_nCgOpUuNsT]*;N U M| A ^~~~~_ PER/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.ccM:U1563T:E24_:C Onote: Uread of non-const variable 'ngpus' is not allowed in a constant expressionN T/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc): 1538 :| 7 ^: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1044:7: warning: unused variable 'nChannels' [-Wunused-variable] 1044 | int nChannels = 0; :1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1054:12: warning: unused variable 'y' [-Wunused-variable] 1054 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; 2032 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char ringRemap[256]; | ^~~~~~~~~ 2 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1563 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1563:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1538:7: note: declared here 1538 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1535:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1535 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1554:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1554 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1625:9: warning: unused variable 't' [-Wunused-variable] 1625 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSyste:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.ccm* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] :18921927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ :1927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.ccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRa/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.ccn:kT2051o:I15n:d ewarning: x(variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension]s truct ncclTopoSys t2051e | m * isnyts tge_mh,i vienst[ nrgapnuks,] ,i nnt_*h iivneds[enxn)e t{s ] ;| ^~~~~~~~~~~~~~~~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc::2332051::2115:: warning: note: unused function 'ncclTopoDevToRank' [-Wunused-function]read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 233 | stati c2034 | n c cilnRte snuglptu_st =n cscylsTtoepmo->DneovTdoeRsa[nGkP(Us]t.rcuoucnt tn; c c| l ^T opoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc245::205121::31 :warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function]warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ :1712:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1712 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1716:7: warning: unused variable 'ncpus' [-Wunused-variable] 1716 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1809:9: warning: unused variable 't' [-Wunused-variable] 1809 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.ccri:n2051g:R15e:m awarning: pvariable length arrays in C++ are a Clang extension [-Wvla-cxx-extension][ 256]; | ^~~~~~~~~ 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1892:11: warning: 'NUMA_CPUS' macro redefined [-Wmacro-redefined] 1892 | #define NUMA_CPUS 4 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1530:11: note: previous definition is here 1530 | #define NUMA_CPUS 2 | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1895:11: warning: 'TOTAL_PERMUTE_COUNT' macro redefined [-Wmacro-redefined] 1895 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1533:11: note: previous definition is here 1533 | #define TOTAL_PERMUTE_COUNT (NUMA_PERMUTE_COUNT*NUMA_PERMUTE_COUNT) | ^ 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.ccst:r1927uc:t14 :n cwarning: cvariable length arrays in C++ are a Clang extension [-Wvla-cxx-extension]l XmlNode** node, c1927o | n s t c ihnatr *j ,a trt[rnNgapmues,] ,c ogn[sntg pcuhsar]*; a t| t ^~~~~r Value) { /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc :| 1927 ^~~~~~~~~~~~: 14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hn:g179p:u21s: =warning: unused function 'xmlSetAttr' [-Wunused-function]s ystem->nodes[GPU].count ;179 | s| t ^a tic ncclResult_t xmlSetAttr(struct ncclXmlNode/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc*: 1927n:o24d:e ,warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension]c onst char *1927 | a t t r Nianmte ,j ,c orn[sntg pcuhsa]r,* gv[anlgupeu)s ]{; | | ^~~~~~~~~~ ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h1927::19224::21 :note: read of non-const variable 'ngpus' is not allowed in a constant expressionwarning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | 192 | sitnatt incg pnucsc l=R essyusltte_mt- >xnmoldSeest[AGtPtUr]I.fcUonusnett;( s t| r ^u ct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1927 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1927:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1900:7: note: declared here 1900 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1897:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1897 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1918:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1918 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:1993:9: warning: unused variable 't' [-Wunused-variable] 1993 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2034:7: note: declared here 2034 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 2051 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2051:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2035:7: note: declared here 2035 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:2032:15: warning: unused variable 'ringRemap' [-Wunused-variable] 2032 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* nodeIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val,, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, str int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, uct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] int defa u151l | tsVtaaltuiec) n{c c l| R ^~~~~~~~~~~~~~~~~~~~e sult_t xm/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.hl:F124i:n21d:N ewarning: xunused function 'xmlGetAttrLong' [-Wunused-function]t Tag(struct ncc l124X | mslt*a txicm ln,c clResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 49 warnings generated when compiling for gfx1100. 49 warnings generated when compiling for gfx90a. 49 warnings generated when compiling for gfx1201. 49 warnings generated when compiling for gfx1101. 49 warnings generated when compiling for gfx1102. 49 warnings generated when compiling for gfx90a. 49 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 49 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/trees.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ : warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/search.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK2( warningcso generatedmm when compiling for -gfx90a>. ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag( | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclRstruct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ esult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1201. 29 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 29 warnings generated when compiling for gfx1100. 29 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1102. 29 warnings generated when compiling for gfx1200. 29 warnings generated when compiling for gfx90a. 29 warnings generated when compiling for gfx1201. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc 29 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for host. 29 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/archinfo.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.ccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ :338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1200. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1201. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc| :232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct nccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResullXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ t_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:104:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 104 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:102:13: warning: unused variable 'ret_domain' [-Wunused-variable] 102 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:69:14: warning: unused variable 'count' [-Wunused-variable] 69 | uint32_t count = 0; | ^~~~~ [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:51:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 51 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:558:13: warning: unused function 'fileExists' [-Wunused-function] 558 | static bool fileExists(char const *filename) | ^~~~~~~~~~ 6 warnings generated when compiling for gfx90a. 6 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 6 warnings generated when compiling for gfx1102. 6 warnings generated when compiling for gfx1200. 6 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 6 warnings generated when compiling for gfx1201. 6 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 9 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1100. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 6 warnings generated when compiling for host. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1102. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/param.cc 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1201. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 22 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocmwrap.cc 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/signals.cc In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1200. 3 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1200. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1201. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/strongstream.cc 3 warnings generated when compiling for gfx1100. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/tuner.cc [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for host. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. 11 warning generated when compiling for gfx1100. warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1200. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ :723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ :127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] :512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:127:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 127 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count */builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | siz ncclTypeSize(dataType); | ^~~~~~ e_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:512:10: warning: unused variable 'nBytes' [-Wunused-variable] 512 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ 4 warnings generated when compiling for gfx1201. 4 warnings generated when compiling for gfx1200. 4 warnings generated when compiling for gfx1100. 44 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:32:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 32 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 4 warnings generated when compiling for host. 15 warnings generated when compiling for gfx1200. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1201. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc 4 warnings generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:199:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 199 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/coll_net.cc:402:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 402 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx1200. 23 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 23 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 23 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 23 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | 23 warnings generated when compiling for gfx1102. uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 23 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(strIn file included from uct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* nod/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncce, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNolResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ de(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:75:21: warning: unused function 'xmlAlloc' [-Wunused-function] 75 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:110:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 110 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:117:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 117 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:124:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 124 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:132:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 132 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:139:21: warning: unused function 'xmlFindTag' [-Wunused-function] 139 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:151:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 151 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:163:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 163 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:179:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 179 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:192:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 192 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:204:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 204 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:217:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 217 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:230:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 230 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:243:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 243 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:255:21: warning: unused function 'xmlGetSub' [-Wunused-function] 255 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:281:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 281 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:288:21: warning: unused function 'xmlAddNode' [-Wunused-function] 288 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:310:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 310 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:323:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 323 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:353:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 353 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/xml.h:366:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 366 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 23 warnings generated when compiling for gfx1101. In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.ccIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | st:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ atic float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_tmp.cc:277:21: warning: unused function 'netDumpMap' [-Wunused-function] 277 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx1100. 23 warnings generated when compiling for gfx1200. 23 warnings generated when compiling for gfx1201. 16 warnings generated when compiling for gfx1101. 23 warnings generated when compiling for gfx1102. 16 warnings generated when compiling for gfx1200. 16 warnings generated when compiling for gfx1102. 16 warnings generated when compiling for gfx1100. 16 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx90a. 16 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 23 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1100. 23 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc 16 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.hIn file included from :11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ ool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 222 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:233:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 233 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:245:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 245 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:258:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 258/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:268:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 268 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:279:13: warning: unused function 'isPow2' [-Wunused-function] 279 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/graph/topo.h:282:12: warning: unused function 'mirrorBits' [-Wunused-function] 282 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 102 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1101. 2210 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 173 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllGather_RING_LL128_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 173 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllGather_RING_LL128_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_gather.h:159:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 159 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreadsIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid),:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: swarning: initializer order does not match the declaration order [-Wreorder-ctor]izeof(T) : 667 | s tepSize_) {t | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ id(ti| group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here a 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ruIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; nTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; In file included from | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro warnings generated when compiling for host. up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint3t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count;In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ id), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = arg/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buff/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitivesric<1>, c0, Pouroto, 0> prims | ^ nt; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp::15:2 : note: In file included from field 'nthreads' will be initialized after field 'tidInBlock'/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: 667warning: | unused variable 'data1' [-Wunused-variable] tid(tid), nth r140e | a ds ( n tuhirneta3d2s_)t, dtaitdaI1n,B lfolcakg(1t,h rdeaatdaI2d,x .flxag),2 ;g r o| u ^~~~~p (g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.hr:140o:u21p:), warning: unused variable 'flag1' [-Wunused-variable] | ^~~~~~~~~~~~~~~~~ 140/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h | : 667 : 60 :u inote: nfield 'group' will be initialized after field 'stepSize't 32_t data1, 667f | l a g 1 ,t idda(ttai2d,) ,f lnatgh2r;e a d| s ^~~~~( nthr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.he:a140d:s28):, warning: tunused variable 'data2' [-Wunused-variable]i dInBlock (140t | h r e a duIidnxt.3x2)_,t gdraotuap1(,g rfoluapg)1,, d| a ^~~~~~~~~~~t a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ s->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthr:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShme_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | ti/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :667 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406d | ( t i dR)u,n Wnotrhkrt,i d%aWlAgRoP,_ SpIrZoE)t,o ,w a2r>p(()t.irdu/nW(A&RnPc_cSlISZhEm)e,m . w| o ~~~~~~~~~~~~~~~~~~r k )| ; stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h505 | : 667 : 15 :w arnote: field 'nthreads' will be initialized after field 'tidInBlock'p InBlock(thread I667d | x. x / W AtRiPd_(StIiZdE)),, n th| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s warp(tid/WARP_SIZE (nthreads) ,506 | tid I n Bflolcakg(TthhrreeadadI(dx(.tixd),% 4gr)o==u3p)(,g rgoruop)u, p(g r| ^~~~~~~~~~~~~~~~~ oup/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h):,667 : 60| : ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize'| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 667 | 507 | t i d ( tsitde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBelso[cNkC(CtLh_rPeRaOTdOI_dLxL.1x2)8,] /gNrCoCuLp_(STgErPoSu/ps)i,z e o| f ^~~~~~~~~~~( uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), n/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203667 | | t i dR(utniWdo)r,k Enltehmreenatdr(o)u.pr(ugnr(owuep));, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7: 1668: | note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here stepSize(stepSi z7e | _D E=F=I N0E _?n cncclcDleSvhFmuenmc.(cAolmlmR.ebduufcfeS_iTzReEsE[_NSCICMLP_LPER_OMTiOn_MSaIxM_PbLfE8],/ NnCcCcLl_FSuTnEcPASl/lsRiezdeuocfe(,T )F u:n csMtienpMSaixz,e _r)c c{l _ b| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l o a| t group(group8 , NCCL_ALGO_TREE, NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hM:P301L:E90): note: | ^in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h: 409301: | 52 : note: expanded from macro 'DEFINE_ncclDevFunc' Primitive s409< | T , R eRduOnpW,o rFka_,M AaXl_gDoE,V _pArRoItToY,> ,4 >/(*)D.irruenc(t&=n*c/c0l,S hPmreomt.ow,o r0k>) ;p r\i m s| ^ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h::15565:: 5note: :field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 667565 | | triudn(Ttriede)U,p Dnotwhnr ,g rCoOuLpL(_gUrNoRuOpL)L,> ( a| r ^~~~~~~~~~~~~~~~~g s); | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h ^: 667:60: note: field 'group' will be initialized after field 'stepSize' 667/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h | : 203 : 66 :t inote: din instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here( tid), nthr e203a | d s ( n t h r e aRdusn)W,o rtkiEdlIenmBelnotc().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8, ncclFu/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid),ncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h tid(tid), nthreads(nthr:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidIn/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? In file included from ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(w/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlo:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElementI(d)x..rxu)n,( wger)o;u p (| g ^r oup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(t:667id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads):667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] , t idInBlock(thr667 | ead I dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : step/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threSaize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_SIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | 1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, protup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h 12 | :D667E:F15I:N Ewarning: _initializer order does not match the declaration order [-Wreorder-ctor]n cclDevFunc(AllReduce_RING_SI M667P | L E _ M itniMda(xt_ibdf)8,, nntchcrleFaudnsc(AnltlhRreedaudcse),, FtuindcIMniBnlMoacxk,( trhcrcela_dbIfdlxo.axt)8,, gNrCoCuLp_(AgLrGoOu_pR)I,N G ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize__ PROTO_SIMPLE) | 668^ | stepSize(ste/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hp:S409i:z52e:_ note: =expanded from macro 'DEFINE_ncclDevFunc'= 0 ? ncclShm e409m | . c o m mR.ubnuWfofrSki, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ redop, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 99 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | In file included from uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | conIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ st ssize_t size = args->count; /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->c/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ount; | ^~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ redop, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tid/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, N/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), CCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp : R2u: nIn file included from Wo/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hr:k10E: lIn file included from e/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hm:e169n: t() .504r | u n ( w et)i;d ( t| i ^d ), nthreads(n/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppt:h12r:e1a:d snote: )in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here, wid(tid%WARP_SIZ E12) | ,D EwFaIrNpE(_tnicdc/lWDAeRvPF_uSnIcZ(EA)l,l R e| d ~~~~~~~~~~~~~~~~~~ u c| e stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)_ RING_SIMP LE505_M | inM a x _ fw32a,r pnIccnlBFluoncckA(ltlhRreedaudIcdex,. xF/unWcMAiRnPM_aSx,I ZflEo)a,t , | N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~CC L | _ warp(tid/WARP_SIZE ALGO_RING, 506N | CC L_ P R OfTlOa_gSTIhMrPeLaEd)( ( | t^i d%4)==/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h3:)409,: 52g:r onote: expanded from macro 'DEFINE_ncclDevFunc'u p(gro u409p | ), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~R u n| Wo warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3rk s, talegpoS, ipzreo(tno,cc l4S>h(m)em.comm.buffSizes[NCCL_PROTO_LL1.28ru]/NnC(&CncLcl_SSThmEPeSm/.swiozerokf()u;i n\t64 _ t| ^) ) { | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 667: 15| : group(group note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h):,62 :n56t:h rnote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested herea ds(nthreads) ,62 | t Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ 15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->countIn file included from ; | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = arIn file included from gs->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 17In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp33:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ | const ssize_t size = a/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ rgs->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UIn file included from NROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | :s667t:15e: pwarning: initializer order does not match the declaration order [-Wreorder-ctor]S ize(s667t | e p tSidi(tzide), _nt hr=ead=s( n0thr ea?ds ),n tcidcInlBlSochk(tmhreeadmId.x.cx)o, mgrmou.p(bgrouufp),f S| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~iz e| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_s [N668C | C L s_tePpSRizOe(TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ic<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested hereIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = aIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ rgs->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ 270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinM/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hax:_667u:3152:, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]c clFuncAllReduce, FuncMinMax ,667 | u i n t 3t2i_dt(,t iNdC)C,L _nAtLhGrOe_aTdRsE(En,t hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc)k ( t| h^r eadIdx.x), grou/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hp(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: inote: zin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heree _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 301 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here> , /*Direct=*/0, P r301o | t o , 0 > Pprriimmist i v| e ^s , ProtoSimple<1, 1, 4>, 4>' requested here, NCCL_MAX_DE V565_ | A RI T Y> ,r u/n*TDriereeUcptD=o*w/n0 tporSiimmpsl e <1| , ^ 1, COLL_UNROLL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h>:,565 :CO5LL:_ UNnote: Rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hereO LL>(args); | 565 ^ | runTre/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.heU:p203D:o66:w n, 0, 2, 4>::run' requested hereT , RedO p203 | , P r o t oRSuinmWporlke, ,A lCgOoL, LP_rUoNtRoO, LCLO>L(aLr_UgNsRO)L; L> (| ^) .run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here203 | 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = argsIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from ->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] In file included from 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSizIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthread/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ 667 | tid(tid/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp):,7 :n1t:h rnote: ein instantiation of member function 'RunWork, 0, 2, 2>::run' requested herea ds(nthreads), tidIn B7l | oDcEkF(tIhNrEe_adnIcdxc.lxD)e,v Fgurnocu(pA(lglroRuepd),u c e| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T R | E tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_E _SIMPLE_MinMax_u 66684 | , n c csltFeupnSciAzlel(RsetdeupcSei,z eF_u n=c=M i0n ?M anxc,c luSihnmte6m4._cto,m mN.CbCuLf_fASLiGzOes_[TNRCECEL,_ PNRCOCTLO__SPIRMPOLTEO]/_NSCICMLP_LSTEE)P S /| s^i zeof(T) : stepS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hiz:e406_:)52 :{ note: expanded from macro 'DEFINE_ncclDevFunc' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 406 | RunWork, a/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hlg:o,301 :p90r:o tnote: o,in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 2>().run(&ncclS h301m | e m . w o r kP)r;i m\i t| i ^v esd, s/(*ntDhirreeacdst)=,* t/id0I,nB loPckr(otthor,e a0d>Id x.pxr),im sg ro u| p ^( group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h::667:560: : note: note: field 'group' will be initialized after field 'stepSize'in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 667 | 565 | runTreeUpDown, C OtLiLd_(UtNiRdO)L,L> n(tahrrgesa)d;s ( n| t ^h reads), tidInBlock(th/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hr:e203a:d66:I dnote: xin instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here. x), group( 203 | RunWorkElement ( )| . ^~~~~~~~~~~r un(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] == 0 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here ? ncc l203S | h m e m . c o m mR.ubnufWfoSrikzEelse[mNeCnCtL<_FPnR,O TOT_,S IRMePdLOEp],/ NACClLg_oS,T EPPSr/ostioz,e oCfO(LTL)_ U:N RsOteLpLS>i(z)e._r)u n{( we )| ; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h_n:c301c:l90D:e vnote: Fin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereu nc(AllReduce_TREE _301S | I M P L E _ MPriinmMiatxi_vue6s4<,T ,n cRceldFOupn,c AFlalnRAesdyumcmeet,r iFcuC,C L/_*ADLiGrOe_cTtR=E*E/,0 ,N CPCrLo_tPoR,OT O0_>S IpMrPiLmEs) | | ^^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :note: 565expanded from macro 'DEFINE_ncclDevFunc': 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 409 | R565u | n W o r krP,r oatlogSoi,m pplreo C(O)L.Lr_uUnN(RO&LnLc>c,l COLSLh_mUeNmR.OwLoLr>k()a;r g\s ) ;| ^ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 667 | t203i | d ( t i d ) , nRtuhnrWeoardksE(lnetmhernetar(o)u.pr)u,n ( w| e ^~~~~~~~~~~~~~~~~) ; | ^/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h :667:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7 :6671 | : note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here tid(tid), nthreads (7n | tDhErFeIaNdEs_)n,c ctliDdeIvnFBulnocc(kA(ltlhRreedaudcIed_xT.x)R,E Eg_rSoIuMpP(LgEr_oMuipn)M,a x _| u ^~~~~~~~~~~6 4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h17 warnings generated when compiling for host. :14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads( RunWorkElement().run(we); nthre | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ ads), tidInBlock(threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, alock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: (we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ L>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t sizIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ e = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizeIn file included from s/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); E]/NCCL_STEPS/sizeof\ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclD/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] evFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x 667 | tid(tid), nt), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from 301/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size->count; | ^~~~ = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o In file included from /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, CIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ OLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ : /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here In file included from 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_n Block(threadIdx.x), group(gr o668u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_z e(stepSize_ == 0 ? n c668c | l S h m esmt.ecpoSmimz.eb(ufsftSeipzSeisz[eN_C C=L=_ P0R O?T On_cScIlMSPhLmEe]m/.NCcCoLm_mST.EbPuSf/fsSiizzeeosf[(NTC)C L:_ PsRtOeTpOS_iSzIeM_)P L{E ] | / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C C| L group(group_ STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h | : 252 : 90 : note: Pin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herer imitivese,t r/i*cD, p1r>i,m s/ * D| i ^r ect=*/0, Proto,/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :0565>: 5p:r inote: min instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested heres | ^ 565 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :r565u:n5T:r enote: ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereU pDowno,S iCmOLLp_lUeN(,a rCgOsL)L;_ U N| R ^O LL>, COLL_UNROLL>(ar/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hg:s203):;66 : | note: ^in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h : 203 : 66 : note: Rin instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested hereu nWorkElementR(e)d.Orpu,n (Awle)g;o ,| ^P roto, COLL_UNROLL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp>:(7):.1:r unote: nin instantiation of member function 'RunWork, 0, 2, 2>::run' requested here( we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp :7 | 7D:E1F:I Nnote: Ein instantiation of member function 'RunWork, 0, 2, 2>::run' requested here_ ncclDevFunc(AllReduce _7T | RDEEEF_ISINMEP_LnEc_cPrleDMeuvlFSuunmc_(bAfl8l,R endcuccleF_uTnRcEAEl_lSRIeMdPuLcEe_,P FruenMcuPlrSeuMuml_Sbufm8, ,r cncclc_lbfFluonactA8l,l RNeCdCuLc_AeL,G OF_uTnRcEPEr, eNMCuClLS_uPmR,O TrOc_cSIlM_PbLfE)l o a| t^8 , NCCL_ALGO_TR/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hE:E406,: 52N:C Cnote: expanded from macro 'DEFINE_ncclDevFunc'L _PROTO_SIM P406L | E ) R| u^n Worknote: , expanded from macro 'DEFINE_ncclDevFunc'a lgo, proto, 2>( ).406r | un ( & ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid),R nutnhrWeoardsk(hr,e aadlIgdox.,x )p, rgortouop,(g r2o>u(p)),. r | u ^~~~~~~~~~~~~~~~~n (&nc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hc:l667S:h60:m enote: mfield 'group' will be initialized after field 'stepSize'. work); \ | 667 ^ | tid(tid), nthr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.he:a667d:s15(:nt hnote: refield 'nthreads' will be initialized after field 'tidInBlock'a ds), tidInBlock(thread I667dx | . x ) , gtrioudp((tgirdo)u,p) , n t| h ^~~~~~~~~~~r eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_T/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hREE:_667S:IM15P:L Ewarning: _initializer order does not match the declaration order [-Wreorder-ctor]P reMulSum_bf8, ncclFuncAllRedu c667e | , F u n ctPirde(MtuildS)u,m ,n trhcrcela_dbsf(lnotahtr8e,a dNsC)C,L _tAiLdGIOn_BTlRoEcEk,( tNhCrCeLa_dPIRdOxT.Ox_)S,I MgPrLoEu)p ( g| r^o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 668 | 406 | s t e p SRiuzneW(osrtkeh,m eaml.gcoo,m mp.rboutfof,S i2z>e(s)[.NrCuCnL(_&PnRcOcTlOS_hSmIeMmP.LwEo]r/kN)C;C L\_ S T| E ^P S/sizeof(T) : stepS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hi:z667e:_15): {note: field 'nthreads' will be initialized after field 'tidInBlock' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 667 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h,: 301t:i90d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herel ock(threadIdx.x )301, | g r o u p (Pgrriomuipt)i,v e s| < ^~~~~~~~~~~~~~~~~T , RedO/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hp:,667 :F60a:n Anote: sfield 'group' will be initialized after field 'stepSize'y mmetric<1, N667C | C L _ M AtXi_dD(EtVi_dA)R,I TnYt>h,r e/a*dDsi(rnetchtr=e*a/d0s,) ,P rtoitdoI,n B0l>o cpkr(itmhsr e a| d ^I dx.x), group(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hg:r565o:u5p:) ,note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadI), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSized_x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(gIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) roup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h(:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | sgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:lFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nth:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h558 | : 667 : 15 :r uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]R ing( ar667g | s ) ; | t ^i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)In file included from ,/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), n group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement():.r667u:n15(:we )warning: ;initializer order does not match the declaration order [-Wreorder-ctor] | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 667 | tid(t i12d | )D,EF InNtEh_rneccaldDse(vnFtuhnrce(aAdllReduces_R)I,N Gt_iSdIIMnPBLlEo_cPkr(tehMruelaSumd_Ifd3x.2x,), nc cglrFouunpc(AglrloRuepd)u,c e ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Fu n | c tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_P reMulSum, f668 | l o at , NsCtCepLSi_AzLeG(Os_RtIeNpGS,iz e_N C=CL=_ P0R ?O TnOc_clSShImMePm.LcEo)m m .| b^uf fSizes[NCCL_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hPR:O406:TO52_:SI Mnote: Pexpanded from macro 'DEFINE_ncclDevFunc'L E]/NCCL_STEP S406/s | i z eo Rfu(nTWo) r: kst, algo, prot/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:o252,:90 :2 note: >in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here( ).run( &252n | c c l S h mPermim.iwtoirvke)s;< T\, R| e ^d Op, FanAsymme/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.ht:r667i:c<15N:C note: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _MAX_DEV_ARITY ,667 | 1 >, / *tiDdi(rteicdt)=*,/ 0nt, hrePraodst(ont,h re0a>d sp)r,i mtsi d I| n ^B lock(th/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hr:e565:a5:d note: Iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hered x.x), 565g | r o urpu(ngTrroeuepU)p,Do wn , C O LtLid_(UtNRiOdL)L,> (nartghsre)a; d s| ( ^n threads/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:)203,: t66:i dnote: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested hereIn Bloc k203( | t h r e a d RIudnxW.orxkE)le, megnrto().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssizIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ e_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp :2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThre:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ ad((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthrea/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cppds-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWo 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PR:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? nc:667clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ?/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] n 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stccepSize_) {lS | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp 21 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: In file included from warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uin/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2t32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t sizeIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | = ^~~~ args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWoIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hp(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement ( )t.irdu(nt(iwde)),; n t| h ^r eads(nthreads)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp,: 7t:i1d:I nnote: Bin instantiation of member function 'RunWork, 0, 2, 2>::run' requested herel ock(threadIdx.x), gr o7u | pD(EgFrIoNuEp_)n,c c l| D ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e v F| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_n c(AllReduce_TR E668E | _ S I M PsLtEe_pPSriezMeu(lsStuemp_Sui3z2e,_ n=c=c l0F u?n cnAclcllRSehdmuecme.,c oFmumn.cbPurffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRin/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ g(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL>, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreadIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ n) { | ^~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uinIn file included from t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOf/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ fset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(weIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hPr:e667M:u15l:S uwarning: m_initializer order does not match the declaration order [-Wreorder-ctor]u 64, ncclFuncAllReduce, FuncPreMulSum ,667 | u i n t 6t4i_dt,( tNiCdC)L,_ AnLtGOh_rTeRaEdEs,( NnCtChLr_ePaRdOsTO)_,S ItMPiLdEI)n B | l^ ock(threa/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hd:I406d:52x:. xnote: expanded from macro 'DEFINE_ncclDevFunc') , group(gro u406p | ) , | R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n W| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_r k ,668 | a l g o ,s tperpoStioz,e (2step>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Size_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadI667dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthrtidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1102. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFuncfSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h, 0, Proto, 0> pri:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tims | d ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RIN), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ G_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | cIn file included from onst ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp2: :In file included from 2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from 10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h10:: 168In file included from : /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:169:: 140:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h14:: 270warning: :19unused variable 'data1' [-Wunused-variable]: warning: unused variable 'ptr' [-Wunused-variable] 140 | 270 | u i n t 3 2 _ t uidnatt6a41_,t *f lpatgr1 ,= draetcav2P,t rf(l0a)g+2l;l 1 2| 8 ^~~~~O ffset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDow 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTn, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor].x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' k(thr 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, ProIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ to, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ :14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBloIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLEIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize__Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1, COLL_UNROLL>, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(steIn file included from pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:: 1note: : expanded from macro 'DEFINE_ncclDevFunc'In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h :40912 | : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h : 126R: uIn file included from n/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.hW:o14r: kIn file included from , algo, proto, 4>(). r46u | ns(&tnactcilcS hlmoenmg. wloorgk2)i;( l\o n g| ^n ) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h::667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp 17 warnings generated when compiling for gfx1102. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128OffIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ set; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->cIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ ount; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const sIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ size_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ COLL_UNROLL>, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_ST 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ EPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclD/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_e) v{F un c| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A l l| R group(groupe duce_TREE_SIMPLE_Prod_f32, ncclFuncAll/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hR:e252d:u90c:e ,note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereF uncProd, float ,252 | N C C L _ A LPGrOi_mTiRtEiEv,e sN,406 :/52*:D inote: rexpanded from macro 'DEFINE_ncclDevFunc'e ct=*/0, Proto, 4060 | > p r iRmusn W o| r ^k note: ,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here algo, proto ,565 | 2 > ( ) .rruunnT(r&enecUcplDSohwmne15,: Cnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'L L_UNROLL>(args )667; | | ^ tid(tid), nthreads(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hn:t203h:r66e:a dnote: sin instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here) , tidInBloc k203( | t h r e a d I d xR.uxn)W,o rgkrEoluepm(egnrto ( ) .triudn((twied));, n| t ^h reads(nthread/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpps:)7,: 1t:i dnote: Iin instantiation of member function 'RunWork, 0, 2, 4>::run' requested heren Block(threadIdx. x7) | ,D EgFrIoNuEp_(ngcrcoluDpe)v,F u n| c ^~~~~~~~~~~( AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hd:InB667l:o15c:k (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]h readIdx.x), gr 667o | u p ( g rtoiud(pt)i,d )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_hr eads(nthreads), t668i | d In B l osctk(tehprSeiazdeI(dxs.tx)e,p Sgriozuep_( gr=ou=p )0, ?| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n cc| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ Shmem. c668o | m m . bsutfefpSSiziez(esste[pNSCiCzLe__ P=R=O T0O _S?I MnPccLlES]h/mNeCmC.Lc_oSmTmE.PbSu/fsfizSeiozfe(sT[)N C:C Ls_tPROeTpO_SSiIMzPeL_E)]/ N{C C L| _S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T E PS| /s group(groupi zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Pri/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hm:i301t:i90v:e snote: , FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereT , RedOp, FanA symmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | Run | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFun/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15cAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkE/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args);/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Ring(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncP/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rod, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ vPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 17 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 17 warnings generated when compiling for gfx1201. 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp :co2n: sIn file included from t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :s10s: In file included from i/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hz:e_169t: s/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hiz:e270 :=19 :ar gwarning: s-unused variable 'ptr' [-Wunused-variable]> count; | ^~~~270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: ^~~~ unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssi/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hze_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int b:i370:19d: warning: unused variable 'size' [-Wunused-variable] =370 | cognst rssizie_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ dOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCIn file included from L_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hSIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ O_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] In file included from 33 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp : 2 : cIn file included from o/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hn:st10 : sIn file included from si/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hz:e168_: t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h s:i140ze: 14= : awarning: runused variable 'data1' [-Wunused-variable]g s->count; | ^~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = re:c667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ vPtr(0)+ll128In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Offset; | ^~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidIIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: :in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52:222 note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSi/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nIn file included from threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2;/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h: | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ ->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ V_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hdIn:Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Di667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWormkeC,L _aPlRgOoT,O _pSrIoMtPoL,E ]2/>N(C).CrLu_n(S&TnEcPcSlS/hsmiezme.owfo(rTk)) ;: \s t e| p ^S ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid):667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] , nthreads(nt h667r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( th r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d I| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_x .x), group(group) , 668 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_t epSize(stepSize_ 668= | = 0 ?s tnecpcSliSzhem(esmt.ecpoSmimz.eb_u f=f=S i0z e?s [nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P L:E ]s/tNeCpCSLi_zSeT_E)P S{/ s i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e o f| ( group(groupT ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :62301 | : 90 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereP rimitives<,T ,0 ,R ePdrOopt,o ,F a0n>A spyrmimmest r i| c ^< 1, NCCL_MAX_DE/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hV:_558A:R5I:T Ynote: >in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here, /*Direct =558* | / 0 , PrruontRoi,n g0<>T ,p rRiemdsO p ,| ^P roto, COLL_UNROLL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h>:(565a:r5g:s )note: ;in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here | ^ 565 | runTr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.he:e203U:p66D:o wnote: nin instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here< T, RedOp, Pro t203o | S i m p l e < 1 ,R u1n,W oCrOkLELl_eUmNeRnOtLn,, CTO,L LR_eUdNROOpL,L >A(lagrog,s )P;r o t| o ^, COLL_UNROLL>().run(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hw:e203):;66 : | note: ^in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp : 12 : 1 : note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested hereR unWorkElement(N)G._rSuInM(PwLeE)_;P r o| d ^_ u64, ncclFuncAl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cppl:R7e:d1u:c enote: ,in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here FuncProd, uint64_ t7, | DNECFCILN_EA_LnGcOc_lRDIeNvGF,u nNcC(CALl_lPRReOdTuOc_eS_ITMRPELEE_)S I M| ^P LE_Prod_u8, nccl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hF:u409n:c52A:l lnote: Rexpanded from macro 'DEFINE_ncclDevFunc'e duce, FuncPr o409d | , u i nRtu8n_Wto,r kN_,P RaOlTgOo_,S IpMrPoLtEo), 4| >^( ).run(&ncclShme/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hm:.406w:o52r:k )note: ;expanded from macro 'DEFINE_ncclDevFunc' \ | ^ 406 | Run/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hW:o667r:k15<:c onote: lfield 'nthreads' will be initialized after field 'tidInBlock'l , ty, redop ,667 | a l g o ,t ipdr(ottiod,) ,2 >n(t)h.rreuand(s&(nnctchlrSehamdesm).,w otrikd)I;n B\l o c| k ^( threadIdx.x), g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hr:o667u:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h17 warnings generated when compiling for host. :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFunc), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groupAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | s/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.htepSiz:409:52:e(stepSi note: expanded from macro 'DEFINE_ncclDevFunc'ze_ == 0 ? ncclShmem.comm.buffSizes[N 409 | CCL_PR RunWork, algo,CL_STE proto, 4>().ruPS/sizeof(n(&ncT) : stepSizclShme_) e{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hm.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ :667:15:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threa warning: initializer order does not match the declaration order [-Wreorder-ctor] dIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().r/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1, COLL_UNROLL>, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc'u n(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/0, Pr oto, 0> 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ prims 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRingz(ea_r)g s{) ; | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h203::30166::90 :note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested herenote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 203 | 301 | RPurniWmoirtkiEvleesmI(T)Y.>r,u n/(*wDei)r;e c t| = ^* /0, Proto, 0>/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp :p12r:i1m:s note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here| ^ 12/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h | :D565E:F5I:N Enote: _in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heren cclDevFunc( A565l | l R e d urcuen_TRrIeNeGU_pSDIoMwPnLt, 8COL_Lt_,U NNRCCOLL_LA>L(GarO_gRsI)N;G , | N ^CC L_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:note: 406in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here :52: note: expanded from macro 'DEFINE_ncclDevFunc' 203 | 406 | R uRnuWnoWrorkkE().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ll, ty, redop, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), w 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here arp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), In file included from nt/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpph:r2e: aIn file included from d/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hs:(10n: tIn file included from h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hr:e167a: ds/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h):,667 :t15i:d Iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]B lock(threadIdx.x), group(group) ,667 | | ^~~~~~~~~~~~~~~~~ tid/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h(:t667i:d60):, note: nfield 'group' will be initialized after field 'stepSize't hreads(nthre a667d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_a dIdx.x), group(g r668o | u p ) , s t| e ^~~~~~~~~~~p Size(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; In file included from | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301504 | | tPirdi(mtiitdi)v,e snd,/ W/A*RDPi_rSeIcZtE=)*,/ 0 ,| ~~~~~~~~~~~~~~~~~~P r o| t stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)o , 0> prim s505 | | ^ warpInBlock(threadIdx./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hx:/565W:A5R:P _note: Sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereI ZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 565 | runT r506e | e U p D ofwlna| , warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 COLL_UNROLL>( a507r | g s ) ; s t| e ^p Size(ncclShmem.comm./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hb:u203f:f66S:i znote: ein instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested heres [NCCL_PROTO _203L | L 1 2 8 ] / N C CRLu_nSWToErPkSE/lseimzeenotf<(Funi,n tT6,4 _Rte)d)O p{, A| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g o ,| group(groupP roto, COLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSizeIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(ui21nt64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads)17 warnings generated when compiling for gfx1102. , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->c 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from ount; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:s2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hi:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | constz ssize_te size = ar=gs->co unt; a| ^~~~ rgs->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_S*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreadsIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h21 warnings generated when compiling for gfx90a. :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hi:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Pze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ rimitivesr,e a/d*s(Dnitrherceta=d*s/)0,, tPirdoItnoB,l o0c>k (ptrhirmesa dI d| x ^. x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 667 | tid(tid), n t565h | r e a d sr(unntThrreeeaUdpsD)o,w nt , COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h667::40915::52 :warning: note: initializer order does not match the declaration order [-Wreorder-ctor]expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork< c667o | l l , ttyi,d (rteiddo)p,< tnyt>h,r eaaldgso(,n tphrroetaod,s )4,> (t)i.drIunnB(l&oncckc(ltShhrmeeamd.Iwodrxk.)x;) ,\ g r| o ^u p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 668 | stepS i667z | e ( s t etpiSdi(ztei_d )=,= n0t h?re andcsc(lnSthhrmeeamd.sc)o,m mt.ibduIfnfBSliozceks([tNhCrCeLa_dPIRdOxT.Ox_)S,I MgPrLoEu]p/(NgCrCoLu_pS)T,E P S| / ^~~~~~~~~~~~~~~~~s izeof(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hT:)667 :60:: snote: tfield 'group' will be initialized after field 'stepSize'e pSize_) { 667 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ti d(| ti group(groupd ), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.ho:c252k(:t90h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hered Idx.x), group( gr252ou | p ) , P| r ^~~~~~~~~~~ imitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mIn file included from antissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] In file included from 370 | const ssize_t si/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cppz:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:e10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: =unused variable 'data1' [-Wunused-variable] 140 | uinat32r_t datga1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ s->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = arIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ gs->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NC/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, CNCCL_ALGO_TREE, NCLCL__PPRROOTTOO__SSIIMMPPLLEE)) | | ^ ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52 :409 | note: expanded from macro 'DEFINE_ncclDevFunc' RunWorkW,o rakl (r)e.droupn<(t&yn>c,c lSahlmgeom,. wporrko)t;o ,\ 4 >| ( ^) .run(&ncclShmem./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hw:o667r:k15):; note: \field 'nthreads' will be initialized after field 'tidInBlock' | ^ 667 | tid(tid), n/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.ht:h667r:e15a:d snote: (field 'nthreads' will be initialized after field 'tidInBlock'n threads), tidInBl o667c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~~~~~~~, tidIn/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hB:l667o:c60k:( tnote: hfield 'group' will be initialized after field 'stepSize'r eadIdx.x), 667g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~t hreads/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h(:n667t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's ), tidInBlock (667t | h r e a dtIiddx(.txi)d, )g,r onutph(rgeraodusp()n,t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PR:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTIn file included from O_S/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cppI:MP1L: EIn file included from ]//builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hN:17C: CIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.hL:_11S: TIn file included from E/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.hP:S12/: In file included from s/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.hi:z126e: oIn file included from f/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h(:T14): In file included from :/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h :s37t: eIn file included from p/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.hS:i14z: e/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h_): 46{: 13 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~warning: unused function 'log2i' [-Wunused-function]| group(group 46 | static long log2i(long n) /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h{: 62| : ^~~~~56 : note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->cIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ount; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64In file included from _t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCLIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>, nthreads(nthreads), ().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nwthreads(id(tnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Y>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buf/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grfSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkEIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ lement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeoIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ss:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ ize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ :15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->coIn file included from unt; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().ru: n(we); In file included from | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:7:1: note: :in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 710 | DEFIN: E_ncclIn file included from DevFunc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h(AllRedu:ce_169TREE_S: IMPLE_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hSum_f64:, nccl506FuncAll:Reduce29, F:uncSum , doublwarning: e, NCCfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]L_ALGO_T REE, N CCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 504406 | Run | Work, al go, protto, 2>().irun(&ncdclShmem.(work)t; \ | i ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h::29667:: 15warning: :field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid( t504i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (wtihdr(etaiddI%dWxA.RxP)_,S IgZrEo)u,p (wgarropu(pt)i,d / W| A ^~~~~~~~~~~~~~~~~R P_SIZ/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hE:)667,: 60 :| ~~~~~~~~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 667505 | | twiadr(ptIindB)l,o cnkt(htrheraedasd(Indtxh.rxe/aWdAsR)P,_ StIiZdEI)n,B l o| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ( t| h warp(tid/WARP_SIZEr eadIdx.x) ,506 | g r o u pf(lgargoTuhpr)e,a d (| ( ^~~~~~~~~~~t id%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp :Ru2n: WoIn file included from rk/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h<:c10o: lIn file included from l/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h,: 167t: y, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hre:d667o:p15<:t ywarning: >,initializer order does not match the declaration order [-Wreorder-ctor] algo, proto, 2>().run(&ncclShmem.work); \ | ^ 667 | tid(t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hi:d667):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads(nthreads), ti667d | I n B l otcikd((tthrieda)d,I dnxt.hxr)e,a dgsr(onupt(hgrreoaudps)),, | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_B lock(threadIdx.x )668, | g r ou ps(tgerpoSuipz)e,( s| t ^~~~~~~~~~~~~~~~~e pSize_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h :=667=: 600: ?note: field 'group' will be initialized after field 'stepSize'n cclShmem.comm .667b | u f f S itzieds([tNiCdC)L,_ PnRtOhTrOe_aSdIsM(PnLtEh]r/eNaCdCsL)_,S tTiEdPISn/Bsliozceko(ft(hTr)e a:d Isdtxe.pxS)i,z eg_r)o u{p ( g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ) group(group, | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h::667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | 667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(RtunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t*In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count;In file included from | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidIIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tid:In667B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t h readId667x | . x ) , tgirdo(utp(igdr)o,u pn)t,h r e| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_n t hrea668d | s) , t isdtIenpBSliozcek((sttherpeSaidzIed_x .=x=) ,0 g?r onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. b u| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_f f Sizes668[N | C C L _ PsRtOeTpOS_iSzIeM(PsLtEe]p/SNiCzCeL__ S=T=E P0S /?s iznecocfl(STh)m e:m .sctoempmS.biuzfef_S)i z{e s [| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupP R/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hOTO_SIMP:L62E:]56/:N Cnote: Cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereL _ STEP62S | / s i z ePorfi(Tm)i t:i ves, 0:,252 :P90r:o tnote: oin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here, 0> pri252m | s | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h Primit:i558v:e5s:< Tnote: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here R edOp558, | F an A sruynmRimnegtO,L L/>*(Dairrgesc)t;= * /| 0 ^, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hProto, :0203>:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitiv/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkEle:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->counIn file included from t; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | constIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | const ssize_t size = args->countIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h ^~~~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cppE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork2,: aIn file included from lg/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.ho:,10 : prIn file included from o/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.ht:o167,: 2/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h>(:)667.:r15u:n (warning: &ninitializer order does not match the declaration order [-Wreorder-ctor]c clShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h667: | 667 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid), nthread s667( | n t h rteida(dtsi)d,) ,t indtIhnreBaldosc(kn(thtrheardesa)d,I dtxi.dIxn)B,l ogcrko(utphr(egardoIudpx.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r ou p| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_g roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667 :66860 | : note: field 'group' will be initialized after field 'stepSize' stepSize(s t667e | p S i z eti_d (=t=i d0) ,? ntnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E, ]g/rNoCuCpL(grou_p)S,T E P| S ^~~~~~~~~~~/ sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^LL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), w17 warnings generated when compiling for host. id(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tiizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), ntIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ uint64_t* ptr = recv:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ Ptr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size =/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | PrimitivesM,A Xa_lDgEoV,_ pArRoItToY,, 21>>(,) /.*rDuinr(e&cntc=c*l/S0h,m ePmr.owtoor,k )0;> \p ri m| s ^ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15 :565 | note: field 'nthreads' will be initialized after field 'tidInBlock' runTreeUpDownt,h rCeOaLdLs_)U,N RtOiLdLI>n(Balrogcsk)(;t h| r ^ eadIdx.x), group(g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hr:o203u:p66):, note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h :203667 | : 60 : note: field 'group' will be initialized after field 'stepSize' RunWorkElemen t667< | Fn , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(on,t hCrOeLaLd_sU)N,R OtiLdLI>n(B)l.orcukn((twher)e;a d I| d ^x .x), group(group/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp):,7 : 1| : ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(21args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthrCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllRehreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSiz/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.:126: x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: 203 | RunWorkElement(warning: ).run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | con/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ st ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ 17 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 17 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(n5t | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO17 warnings generated when compiling for gfx1200. _SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSi/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), gr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.houp(group), :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ ZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_tIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffse/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size t / channelCount; | ^~~ = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | cons/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ t int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement:(10): .rIn file included from u/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:n169(: w/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.he:)506;: 29: | warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7 :5041 | : note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here tid(tid), nthreads(nt h7r | eDaEdFsI)N,E _wnicdc(ltDiedv%FWuAnRcP(_ASlIlZREe)d,u cwea_rTpR(EtEi_d/SWIAMRPPL_ES_ISZuEm),P o s| t ~~~~~~~~~~~~~~~~~~D i v| _ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)i 8, ncclFunc A505l | l R e d uwcaer,p IFnuBnlcoScukm(PtohsrteDaidvI,d xi.nxt/8W_AtR,P _NSCICZLE_)A,L G O| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T R | E warp(tid/WARP_SIZEE , NCCL_PR O506T | O _ S I MfPlLaEg)T h r| e^a d((tid%4)==3), g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hr:o406u:p52(:g rnote: oexpanded from macro 'DEFINE_ncclDevFunc'u p), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 406 | RunWork <507c | o l l , sttye,p Sriezdeo(pnS,h maelmg.oc,o mpmr.obtuof,f S2i>z(e)s.[rNuCnC(L&_nPcRcOlTSOh_mLeLm1.2w8or]k/)N;C C\ L _| S ^T EPS/sizeof(uint64_t)) /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h{: 667 :| 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'nthreads' will be initialized after field 'tidInBlock' group(group 667 | tid(tid), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: nin instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested heret hread s421( | n t h r e a d s )p,r itmisd(ItniBdl,o cnkt(htrheraedasd,I dtxr.exe)-,> dgorwonu,p (tgrreouep-)>, d o| w ^~~~~~~~~~~~~~~~~n , args-/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h>:667s:e60n:d bnote: ufield 'group' will be initialized after field 'stepSize'f f, args->rec v667b | u f f , tairdg(st-i>dr)ed,O npthrAeragd)s; ( | nt ^ hreads),/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h :1065t:i5d: Inote: nin instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested hereB lock 1065( | t h rreuandTrIeedSpxli.tx(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), wa/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comrp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: 667initializer order does not match the declaration order [-Wreorder-ctor] | tid(tid), nthreads(nthrea d667s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~( threadI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hd:x667.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~667 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ tid(tid), nt h668r | e a d s (snttehprSeiazdes()s,t etpiSdiIzneB_l o=c=k (0t h?r enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~C L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_In file included from SI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cppM:P2L: E)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h : 33| :^19 : warning: unused variable 'size' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc'33 | const 409s | s i z e _Rtu nsWiozrek <=c oalrlg,s -t>yc,o urnetd;o p <| t ^~~~y >, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ MP/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hL:E558]:/5N:C Cnote: Lin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here_ STEPS/siz e558o | f ( T ) r:u nsRtienpgS(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: 203in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here | RunWorkEle m252e | n t < F n , PTr,i mRietdiOvpe,s <(N)C.CrLu_nM(AwXe_)D;E V _| A ^R ITY, 1>, /*Dire/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cppc:t12=:*1/:0 ,note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested hereP roto, 0> prims | ^12 | DEFIN/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hE:_565n:c5cl:D enote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested herev Func( A565l | l Re d urucneT_rReIeNUGp_DSoIwMnP,c SCOuLLm_PUoNsRtODLLi>(args); | ^ v, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hi:n203t:866_:t ,note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ dOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | PrimIn file included from i/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ :222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10l: ock(tIn file included from hreadI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hdx.x):, gro169up(gro: up), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:20356 | : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here RunWo r62k | E l e m ePnrti, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing,( aPrrgost)o;, C| O ^L L_UNROLL>().run(w/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.he:)203;: 66 :| ^note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement, 0, 1, 2>::run' requested here Algo, Proto, COLL_UNR O5L | LD>E(F)I.NrEu_nn(cwcel)D;e v F| u ^n c(AllReduce_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cppT:R10E:E1_:L Lnote: 1in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here2 8_SumPostDiv_u64, 10n | cDcElFFIuNnEc_AnlclcRleDdeuvcFeu,n cF(uAnllReduce_RING_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ cSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSi/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hz:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(stepSize_ == 0In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLLIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | s>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tatic long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.com/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Pm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rimitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | cons/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cppt ssi:ze_t 2size =: args/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ :33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1,In file included from data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:222:19: warning: unused variable 'size' [-Wunused-variable] 222 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:370:19: warning: unused variable 'size' [-Wunused-variable] 370 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:371:15: warning: unused variable 'bid' [-Wunused-variable] 371 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreadads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grs(nthreads),oup), | ^~~~~~~~~~~ wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1065:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1065 | runTreeSplit(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ */0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:252:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 252 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? : note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:1057:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1057 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:301:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 301 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 0, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:62:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 62 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1201. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1200. 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~ [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 109 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Broadcast_RING_LL128_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 109 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Broadcast_RING_LL128_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 9 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buff9 warnings generated when compiling for gfx1101. Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:58:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 58 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 95 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 17 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1102. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1201. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 17 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = re17 warnings generated when compiling for gfx1201. cvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp::1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h11: :13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] :37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46 :13: warning: unused function 'log2i' [-Wunused-function] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 46 | static long log2i(long n) { | ^~~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 17 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp 9 warnings generated when compiling for gfx1200. 99 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantisIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ sa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cppIn file included from :1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | 504 tid(tid), nthreads(nthreads), | tid wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ (tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | 9 warnings generated when compiling for gfx1200. ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 99 warnings generated when compiling for gfx1100 warnings generated when compiling for gfx1102. . 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ _LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from 9/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32In file included from _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] : warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h: uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hMSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h9 warnings generated when compiling for gfx1100. :220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(In file included from tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:132, fl: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, fIn file included from lag1, d/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1a: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ta2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hPrimitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wi/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506d:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ (tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 9 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PR:O506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] TO_LL1 504 | 28]/ tid(tiNCCL_STdEPS/si)zeo,f(uint64_t)) { nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group9 warnings) generated when compiling for host. , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInter:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), pntreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ reads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cppreads), wid(tid%WARP_SIZE), :warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cppSIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThrea:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:d((tid%4)=13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: =3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEwarning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(PS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp :2201 | : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hP:r12i: mIn file included from i/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.ht:i13v: e/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.hs:<76T:,18 :R ewarning: dunused variable 'y' [-Wunused-variable]O p, FanAsymmet r76i | c < 1 , 1 > , 1u,i nPtr3o2t_ot, y0,> hperaidm,s m a| n ^t issa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(n(nthreads), tidInBlock(threadIdx.x), group(groIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreadIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] s), wid(tid%WARP_S 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group IZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:REDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 9 warnings generated when compiling for gfx1200. 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)9 warnings generated when compiling for gfx90a. , | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 9/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hIn file included from :/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | In file included from 9 stepSize(stepSize warnings generated when compiling for gfx90a. _ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:nthrea3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, PrhotoLL128, fullreOps>a(comm, dalgo, work);s \ | ^ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 99 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 99 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp 9 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uintIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, dataIn file included from 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+llIn file included from 128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, doubleIn file included from , f/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cppa:l1s: eIn file included from )/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h;: 13 : | In file included from ^/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h :167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:initializer order does not match the declaration order [-Wreorder-ctor]405 :3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclR u667n | I n t e rtpirde(tteird<)ty,p en, tFhurneca#d#sd(envrtehdroepa,t PidrIontBolLLo1c28k,( tfhulrelaOpds>I(dcxo.mxm), ,al ggroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: owarning: , workinitializer order does not match the declaration order [-Wreorder-ctor]); \ | ^ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h_:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | st/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.he:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | 220 tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | | Primitives, 1, Proto, 0> prims | ^ mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int 667 | t3i2d_(tt,i df)a,l snet)hr;ea d s| (^n threads), tidInBlock(th/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hr:e408a:d3I:d x.note: x)expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE', group(group), | ^~~~~~~~~~~~~~~~~ 408 | mscclRunInterpre/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.ht:e667r:<60t:y pnote: efield 'group' will be initialized after field 'stepSize', Func##devredop | , P rtoitd(otSiidm)p,l net , | f ^~~~~~~~~~~u llOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBloc9k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 99 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvIn file included from Pt/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cppr:(01): +In file included from ll/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h1:2138: OIn file included from f/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hf:s169e: t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h; :270 :| 19 ^~~: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uinIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptrIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1ize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> primsIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclR/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | unInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(s667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tepSize_ = tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ = 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:220:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 220 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:408:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 408 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &rIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ing->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ : warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, fla 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cppo:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.hc:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hk:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:(140:14:t warning: unused variable 'data1' [-Wunused-variable] In file included from 140 | uint32_t data1, flag1, data2, fIn file included from l/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from hre/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hadId:x.x),169 gro: up(gr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.houp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uin | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:9 warnings generated when compiling for host. 37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.In file included from comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ id(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h warnings generated when compiling for host. :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7:In file included from note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ Op, Algo, Proto, COLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint6In file included from 4/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1,_t* data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algIn file included from o, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o In file included from /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ )+ll12In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 8Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, ar/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hgs-:>667c:o15n:n Inwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]e x); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 667 | 63t | i d ( t irdu)n,R inntghh(raeragdsI)d;x . x| ) ^ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_203 :66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 668 | 203 | s t e p S i zReu(nsWtoerpkSEilzeem_e n=t=< F0n ,? Tn,c cRleSdhOmpe,m .Aclogmom,. bPurfoftSoi,z eCsO[LNLC_CULN_RPORLOLT>O(_)S.IrMuPnL(Ew]e/)N;C C L| _ ^S TEPS/sizeof(T) : /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpps:t7e:p1S:i znote: ein instantiation of member function 'RunWork, 1, 2, 4>::run' requested here_ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | ste:pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepS506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ ==ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: 9 warnings generated when compiling for gfx1201. note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Algo, Proto, COLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warningsIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | generated when compiling for gfx1201. uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | ti10 warnings generated when compiling for gfx90a. d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uinIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ t32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cppsi:z1e: of(In file included from T/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h): 17:: In file included from st/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.he:p11S: iIn file included from ze/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h_:)12 : {In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h :| 126 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h| : group(group14 : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 46 | static long log2i (33l | o n g n ) p{r i m| s ^~~~~( tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(ti/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group d/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(t runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 99 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t*unused variable 'data1' [-Wunused-variable] p140 | tuintr32_ t d=ata r1, felagc1, vdataP2, ftlagr2; (| ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h0:)140:21+: warning: unused variable 'flag1' [-Wunused-variable]l l140 | 1 u2int382_tO datfa1, fflags1, edatta2, ;fla g2 ; | | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h ^~~ :140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(ti/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] In file included from 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(ttid(tid)hread, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid)In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElementIn file included from ().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: 667 | tid(tid), nthreads(nthreads), tidInBlocunused variable 'ptr' [-Wunused-variable] k(threadIdx. 270 | x), group(group), | ^~~~~~~~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from 0/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ )+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr9 warnings generated when compiling for host. = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>()In file included from ./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cppr:un1(: &In file included from ncc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hl:S17h: mIn file included from e/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.hm:.11w: oIn file included from r/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.hk:)12;: In file included from \/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h : 126| : ^In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h::46667::1315:: warning: note: unused function 'log2i' [-Wunused-function]field 'nthreads' will be initialized after field 'tidInBlock' 667 | ti d46( | tsidt)a,t inct hlreoandgs (lnotgh2rie(aldosn)g, nt)i d{I n B| l ^~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h warnings generated when compiling for host. :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hp):,667 : 15| : ^~~~~~~~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | 667t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:| 667:15: tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthread668s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRingIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFu/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args-n>cReducre, FunecProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16, ncclFuncReduce, FuncProd, half, NCCL_ALIn file included from GO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10initializer order does not match the declaration order [-Wreorder-ctor] : In file included from 667 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.hk:167(threadIdx.x), gro: up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | p667 | tid(tidrims(t), nthid, nthreads, &reads(nthrring->preads),e tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ v, &ring->next, args->sendbuff, args->recvbuIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->nextff, args->redOpArg, 0, args->connIndex, args->connIndex), args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, COLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ vbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShme6m4.,c onmcmc.lbFuufnfcSRiezdeusc[eN,C CFLu_nPcRPOrToOd_,S IdMoPuLbEl]e/,N CNCCLC_LS_TAELPGSO/_sRiIzNeGo,f (NTC)C L:_ PsRtOeTpOS_iSzIeM_P)L E{) | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 406 | RunWork< c33o | l l , t y ,p rriemdso(pt ,n tahlrgeoa,d sp,r o&troi,n g2->>(p)r.ervu,n (&&rnicncgl-S>hnmeexmt.,w oarrkg)s;- >\s e n| d ^b uff, args->recvbuff, ar/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hg:s667-:>15r:e dnote: Opfield 'nthreads' will be initialized after field 'tidInBlock'A rg, 0, args->co n667n | I n d e xt,i da(rtgisd-)>,c onntnhIrnedaedxs)(;n t h| r ^e ads), tidI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.hn:B63l:o5c:k (note: tin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested hereh readIdx. x63) | , g r oruupn(Rgirnogu(args )667; | | ^ tid(tid), nthreads/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h(:n203t:h66r:e anote: din instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested heres ), tidInBlo c203k | ( t h r e a d I dRxu.nxW)o,r kgErloeumpe(ngtr().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PRO/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)TO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, arIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ gs->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h9:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | warnings generated when compiling for host. prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ buff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp 99 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 99 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx1100. 99 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | : uint32_t d140ata1, flag1:, data2, 21flag2;: | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h :140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15 :| ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ =In file included from = 0 ? /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1n: In file included from c/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from c/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from l/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.hS:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37m: In file included from e/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:m14: ./builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:c13o: warning: munused function 'log2i' [-Wunused-function] m 46 | st.atic long log2i(long n) { | ^~~~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncPOLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rod, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2s: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10i: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:z15: warning: initializer order does not match the declaration order [-Wreorder-ctor] e667 | o tifd(tid)(, nthrTeads(n)thread s), ti:dInBlo ck(thrseadIdx.tx), greoup(gropup), S| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ i 668 | z stepSieze(stepS_ize_ ==) 0 ? n cclShme{m.comm. buffSi zes[NCCL| _PROTO_ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SIMPLE] /NCCL _STEPS/sizeof(T) : stepS| ize_) group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ stepSize(stepSize_ == 0 ? In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | steIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tpSize(stepSize_ == 0 ? ncclShmeIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, argsi-d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' >sendbuff, args->recvbuff, args->redOpArg, 0 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nth/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tidreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthread/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->s(nthreads), tidInBprev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1101. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28f: warning: unused variable 'data2' [-Wunused-variable] l a140g | 1 , d autian2t,3 2f_lta gd2a;t a1 ,| ^~~~~f lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h35:: warning: 140unused variable 'flag2' [-Wunused-variable]: 21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t d140a | t a 1 , ufilnatg312,_ td adtaat2a,1 ,f lfalga2g;1 , | d ^~~~~a ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == warning0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here e m203. | c o m m . b u f fRSuinzWeosr[kNEClCeLm_ePnRtOi(z)e._r)u n{( w e| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~; | | group(group ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_nc/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.hcl:D34e:v7F:u nnote: c(in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereR educeScatter_RING_SIMPL E34_ | M in M a x _ fp1r6i,m sn(ctcildF,u nnctRherdeuacdesS,c a&trtienrg,- >FpurnecvM,i n&Mraixn,g -h>anlefx,t ,N CaCrLg_sA-L>GsOe_nRdIbNuGf,f ,N CaCrLg_sP-R>OrTeOc_vSbIuMfPfL,E )a r g| s^- >redOpArg,/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h :0406,: 52a:r gnote: sexpanded from macro 'DEFINE_ncclDevFunc'- >connIndex, 406a | r g s - >RcuonnWnoIrnkd, alg/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.ho:,65 :p5r:o tnote: oin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here, 2>().run (65& | n c c l SrhumneRmi.nwgonote: (field 'nthreads' will be initialized after field 'tidInBlock'a rgs); | ^ 667 | tid(tid)/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h,: 203n:t66h:r enote: ain instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested hered s(nthreads), 203t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~~~~~~~o , Prot/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.ho:,667 :C60O:L Lnote: _field 'group' will be initialized after field 'stepSize'U NROLL>().ru n667( | w e ) ; t i| d ^( tid), nthread/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpps:(7n:t1h:r enote: ain instantiation of member function 'RunWork, 1, 2, 2>::run' requested hered s), tidInBlock(thr e7a | dDIEdFxI.NxE)_,n gcrcoluDpe(vgFruonucp()R,e d | ^~~~~~~~~~~ uceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid,In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, ar/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hg:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>()s->redOpArg, 0, args->connIndex, a:667:rgs->connIndex)15; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: .run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uin/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cppt32_t: da2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:ta1691, flag: 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h667::1715: :In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'11 : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h667: | 14 : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h :t37i: dIn file included from (/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.ht:i14d: ),/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h n:t46h:r13e:a dwarning: s(unused function 'log2i' [-Wunused-function]nt hreads), tidInBlock(thread I46dx | .sxt)a,t igcr oluopn(gg rloougp2)i,( l o| ^~~~~~~~~~~~~~~~~n g n) {/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h : 667| : ^~~~~60 : note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t dat/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCC9 warnings generated when compiling for gfx1201. L_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' ter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h[ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz/usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :140:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElementTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: ().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>()PLE]/NCCL_STEPS/si.run(&ncclShmem.work); \ | zeof(T ^ ) : s/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads)tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from 668 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp: 1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h :17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h :11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.hs:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.ht:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.he:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.hp:37: In file included from S/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.hi:46:13:z warning: unused function 'log2i' [-Wunused-function] e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, 9&ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthread warnings generated when compiling for gfx1201. s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t datIn file included from a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | 9 warning uint3s generated when compiling for gfx1102. 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex)In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthr; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(n/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.cothreads), tmim.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nt, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | 9 stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ warnings generated when compiling for host. :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ r = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 10/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | E), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cppIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t dIn file included from ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint6/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 4_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ L_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.houp(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads)In file included from ,/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp :t1i: dIn file included from In/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hB:17l: oIn file included from c/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.hk:(11: tIn file included from h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.hr:e12adI: dIn file included from x/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:.126x: In file included from )/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:,14 : In file included from g/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.hr:o37: uIn file included from p/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:(g14: r/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.ho:u46p:),13 : | warning: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~unused function 'log2i' [-Wunused-function] | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 46 | s t668a | t i c lsotnegp Sliozge2(is(tleopnSgi zne)_ {= = | 0 ^~~~~ ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t yIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ , head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1101. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp 9 warnings generated when compiling for gfx1200. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(In file included from tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(n:thre79:5ads): note: , tidInin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReducBlock(threadIdx.x), group(groupe), Scatter_RING_LL128_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthrIn file included from eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->re 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hi:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(loIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | In file included from uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().rIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, un(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.huffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndexIn file included from ); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>(/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, argsIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | sta/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tic long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h9:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp 9 warnings generated when compiling for gfx1200. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here In file included from 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tiIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/Nd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSiIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), ti667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here), g roup(gr 65 | oup), runRing(args); | ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here ? ncclShmem.comm.buf fSizes[NCC203 | RunL_PROTO_SIMPLE]/NWorkElement().run(we); | ^ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 34 | pr 7 | DEFims(tid, nthIrNeads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, n9 warnings generated when compiling for host. threads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cppIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9In file included from warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1In file included from , flag1, dat/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp=:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ = 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18:In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uiIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ nt64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uiIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP:506:S/size29:of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here warning: 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args-In file included from >sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h warnings generated when compiling for gfx1101. :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:9 warnings generated when compiling for gfx1102. 11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : steIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? npSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warningIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ s generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, fIn file included from lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(t9 warnings generated when compiling for gfx1200. id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreadIn file included from s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp.:2: comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock:(667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto, COLL_UNROLL>(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we. ); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr9oup), | ^~~~~~~~~~~ warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, dat/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(ste_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | flag2; ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreaIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here d s203( | n th r e ad s ) , RtuindWIonrBklEolcekm(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e n t| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_F n, T, RedOp, Algo, Proto, C668O | L L _ U NsRteOpLSLi>z(e)(.srtuenp(Swiez)e;_ =| = ^ 0 ? ncclShmem. comm.buffSize/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpps:[7N:C1C:L _note: Pin instantiation of member function 'RunWork, 1, 2, 2>::run' requested hereR OTO_SIMPLE]/NCCL_ S7T | EDPES/FsIiNzEe_onfc(cTl)D e:v Fsutnecp(SRiezdeu_c)e _{R I N| G ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ S I| M group(groupP LE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALG/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7O:_ Rnote: INin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereG , NCCL_PROTO_SIMPLE) | ^ 33 | prim/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.hs:(406t:i52d:, note: nexpanded from macro 'DEFINE_ncclDevFunc't hreads, &rin g406- | > p r e vR,u n&Wroirnkg<-c>onlelx,t ,t ya,r rgesdo-p>n,d baulfgfo,, parrogtso-,> r2e>c(v)b.urfufn,( &anrcgcsl-S>hrmeedmO.pwAorrgk,) ;0 ,\ a r| g ^s ->connIndex, ar/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hg:667s:-15>:c onote: nfield 'nthreads' will be initialized after field 'tidInBlock'n Index); | ^ 667 | t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.hi:d63(:t5i:d )note: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here nthreads( n63t | h r eraudnsR)i, ntgi((garrogusp));, | | ^ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h :203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 667 | t203i | d ( t i d ) , nRtuhnrWeoardksE(lnethmreenatd)(,) . r| u ^~~~~~~~~~~n (we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cppbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :667:15: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cppwarning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepS:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:ize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(ti17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ d, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCLIn file included from _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, 9 warnings generated when compiling for gfx1102. mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElementprev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ OLL_UNROLL>().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRingIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ (args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flIn file included from ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), groIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667up(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, :nt60h:r eanote: dsfield 'group' will be initialized after field 'stepSize', &ring->prev, &ring->next, args->sen d667b | uf f , atrigds(-t>irde)c,v bnutfhfr,e aadrsg(sn-t>hrerdeOapdAsr)g,, t0i,d IanrgBsl-o>ccko(ntnhIrnedaedxI,d xa.rxg)s,- >gcornonuIpn(dgexr)o;u p | ) ^, | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h7 | :D667E:F15I:N Ewarning: _initializer order does not match the declaration order [-Wreorder-ctor]nc clDevFunc(R e667d | uc e _ RtiIdNG(_tSiId)MP,L Ent_Shrume_aud8s(,n tnhccrelaFdusn)cR,e tduicdeI,n FBlunoccSku(tmh,r euaidntI8d_x.tx, ),NC CgrL_ouApL(GOg_roRuIpN)G,, N| CC ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ | PR tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_O TO_SIM PL668E | ) | ^s tepSiz/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.he:409(:s52t:e pnote: Sexpanded from macro 'DEFINE_ncclDevFunc'i ze_ = = 4090 | ? ncRuclnSWohmrekm<.coclomlm, .btyuf,f rSiedzeosp[C,L _aPlRgOTo,O _SprIoMtPoL,E ]/4N>C(C)L._rSuTnE(P&Sn/cscilzSehomfe(mT.)w o:r ks)t;e p\S i z| e ^_ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ :33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_nc) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 1010 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1201. 9[ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ta1,In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp([ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads),In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.In file included from comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBloc9 warnings generated when compiling for host. k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElemen 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* 46 | static long log2i(long n) { | ^~~~~ ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:506:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 504 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 505 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 506 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 1, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here oup), | ^~~~~~~~~~~ 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 2>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidI[ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumP/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.ho:s667t:D15:i vwarning: initializer order does not match the declaration order [-Wreorder-ctor]_ u64, ncclFun c667R | e d u c et,i d(Ftiud)n,c nStuhmrePaodsst(nDtihvr,e audisn)t,6 4t_itdI, nNBClCoLc_kAL(GtOh_rReIaNdGI,d xN.CxC), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), L_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ :667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg,Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:203:66: note: in instantiation of member function 'RunWorkElement, 1, 2, 4>::run' requested here 203 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60300 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/git_version.cpp In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 86 | Primitives, 0, ProtIn file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:210:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 210 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:222:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 222 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: 9 warnings generated when compiling for gfx1101. note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nt9 warnings generated when compiling for gfx1201. hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:210:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 210 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 86 | Primitives, 0,/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:222:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 222 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.hnBl:o667c:k15(:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adIdx.x), group(group), | ^~~~~~~~~~~ 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS8iz,e sn[cNcClCFLu_PnRcOSTeOn_dSRIeMcPvL,E ]F/uNnCcCSLu_mS,T EiPnSt/8s_itz,e oNfC(CTL)_ A:L GsOt_eRpISNGi,z eN_CC)L _{P R O| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O _ S| I group(groupM PLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h406::14752::62 :note: expanded from macro 'DEFINE_ncclDevFunc'note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 147 | 406 | RPunrWiomriktA,s yamlmgeot,r ipcr2,> (0).,r uPnr(o&tnoc,c l1S>h mpermi.mwso r k| ) ^; \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runRecv>' requested here /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 214 | r u667n | R e c vtd>I(ntBildo,ck (nthtrheraedaIddsx,. xg)r,o ugpro,u pa(rggrso)u;p ) ,| ^ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp60::3 :note: 1field 'group' will be initialized after field 'stepSize': note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 667 | t i3d | (tDiEdF)I,N En_tnhcclrDeeavdFsu(nnct(hSreenaddRse)c,v _tRiIdNIGn_BSlIoMcPkL(Et_hSruema_diI8d,x .nxc)c,l FgurnocuSpe(ngdrRoeucpv),, F u| n ^~~~~~~~~~~c Sum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:406:52: note: expanded from macro 'DEFINE_ncclDevFunc' 406 | RunWork, algo, proto, 2>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:147:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 147 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 668 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:86:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 86 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2, 4>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWork, 1, 2, 4>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:409:52: note: expanded from macro 'DEFINE_ncclDevFunc' 409 | RunWork, algo, proto, 4>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/prims_simple.h:667:60: note: field 'group' will be initialized after field 'stepSize' 667 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1200. 9 warnings generated when compiling for gfx1201. 9 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/hipcc -fPIC -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -parallel-jobs=1 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -Xlinker --dependency-file=CMakeFiles/rccl.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/register.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gclang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] Elapsed time (seconds): 2300.29 ensrc/reduce_scatter_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.3.42133 --hip-link --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -lpthread -lrt -ldl /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' [100%] Built target rccl gmake[1]: Leaving directory '/builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.pwUHmX + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + '[' /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT '!=' / ']' + rm -rf /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT ++ dirname /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT + mkdir -p /builddir/build/BUILD/rccl-6.3.0-build + mkdir /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.3.0 + DESTDIR=/builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "RelWithDebInfo" -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/librccl.so.1.0 -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/librccl.so.1 -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/librccl.so -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/include/rccl/rccl.h -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/include/rccl/nccl_net.h -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/include/rccl/amd_detail/api_trace.h -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple-op.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple_2.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-0-9kb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-190kb-512kb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-512kb-7mb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-7mb-43mb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-9kb-190kb.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets-relwithdebinfo.cmake -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + echo s@/builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT@@ + find /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64 -name '*.so.*.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64 -name '*.so.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64 -name '*.so' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/lib64 -name '*.cmake' + sed -f br.sed + '[' -f /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt ']' + rm /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + /usr/bin/find-debuginfo -j4 --strict-build-id -m -i --build-id-seed 6.3.0-3.fc42 --unique-debug-suffix -6.3.0-3.fc42.x86_64 --unique-debug-src-base rccl-6.3.0-3.fc42.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0 find-debuginfo: starting Extracting debug info from 1 files DWARF-compressing 1 files dwz: ./usr/lib64/librccl.so.1.0-6.3.0-3.fc42.x86_64.debug: Unknown debugging section .debug_str_offsets sepdebugcrcfix: Updated 0 CRC32s, 1 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/rccl-6.3.0-3.fc42.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j4 + /usr/lib/rpm/redhat/brp-python-hardlink + /usr/bin/add-determinism --brp -j4 /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT Scanned 38 directories and 313 files, processed 0 inodes, 0 modified (0 replaced + 0 rewritten), 0 unsupported format, 0 errors Reading /builddir/build/BUILD/rccl-6.3.0-build/SPECPARTS/rpm-debuginfo.specpart Processing files: rccl-6.3.0-3.fc42.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.0C1xQ0 + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + cd rccl-rocm-6.3.0 + LICENSEDIR=/builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/licenses/rccl + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/licenses/rccl + cp -pr /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/LICENSE.txt /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/licenses/rccl + RPM_EC=0 ++ jobs -p + exit 0 Provides: librccl.so.1()(64bit) rccl = 6.3.0-3.fc42 rccl(x86-64) = 6.3.0-3.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_4.3)(64bit) libamdhip64.so.6(hip_4.5)(64bit) libamdhip64.so.6(hip_5.0)(64bit) libamdhip64.so.6(hip_5.3)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.16)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.3)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.6)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_12.0.0)(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) librocm_smi64.so.1()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Processing files: rccl-devel-6.3.0-3.fc42.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.2WJpRx + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + cd rccl-rocm-6.3.0 + DOCDIR=/builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/doc/rccl-devel + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/doc/rccl-devel + cp -pr /builddir/build/BUILD/rccl-6.3.0-build/rccl-rocm-6.3.0/README.md /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT/usr/share/doc/rccl-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(rccl) = 2.21.5 rccl-devel = 6.3.0-3.fc42 rccl-devel(x86-64) = 6.3.0-3.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: cmake-filesystem(x86-64) librccl.so.1()(64bit) Processing files: rccl-data-6.3.0-3.fc42.noarch Provides: rccl-data = 6.3.0-3.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debugsource-6.3.0-3.fc42.x86_64 Provides: rccl-debugsource = 6.3.0-3.fc42 rccl-debugsource(x86-64) = 6.3.0-3.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debuginfo-6.3.0-3.fc42.x86_64 Provides: debuginfo(build-id) = 793ce1fa451a4cb525fc8869e2558ad127bea978 librccl.so.1.0-6.3.0-3.fc42.x86_64.debug()(64bit) rccl-debuginfo = 6.3.0-3.fc42 rccl-debuginfo(x86-64) = 6.3.0-3.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: rccl-debugsource(x86-64) = 6.3.0-3.fc42 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILD/rccl-6.3.0-build/BUILDROOT Wrote: /builddir/build/RPMS/rccl-debugsource-6.3.0-3.fc42.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-devel-6.3.0-3.fc42.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-debuginfo-6.3.0-3.fc42.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-data-6.3.0-3.fc42.noarch.rpm Wrote: /builddir/build/RPMS/rccl-6.3.0-3.fc42.x86_64.rpm Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.Wn9nh2 + umask 022 + cd /builddir/build/BUILD/rccl-6.3.0-build + test -d /builddir/build/BUILD/rccl-6.3.0-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/rccl-6.3.0-build + rm -rf /builddir/build/BUILD/rccl-6.3.0-build + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild rccl-6.3.0-3.fc42.src.rpm Finish: build phase for rccl-6.3.0-3.fc42.src.rpm INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-42-x86_64-1741782565.967003/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/rccl-6.3.0-3.fc42.src.rpm) Config(child) 71 minutes 19 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "rccl-debugsource", "epoch": null, "version": "6.3.0", "release": "3.fc42", "arch": "x86_64" }, { "name": "rccl-data", "epoch": null, "version": "6.3.0", "release": "3.fc42", "arch": "noarch" }, { "name": "rccl", "epoch": null, "version": "6.3.0", "release": "3.fc42", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.3.0", "release": "3.fc42", "arch": "src" }, { "name": "rccl-debuginfo", "epoch": null, "version": "6.3.0", "release": "3.fc42", "arch": "x86_64" }, { "name": "rccl-devel", "epoch": null, "version": "6.3.0", "release": "3.fc42", "arch": "x86_64" } ] } RPMResults finished